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CHLAMYDIA PNEUMONIAE GENOMIC SEQUENCE AND POLYPEPTIDES. 
FRAGMENTS THEREOF AND USES THEREOF. IN PARTICULAR FOR THE DIAGNOSIS. 
PREVENTION AND TREATMENT OF INFECTION 

5 

The subject of the invention is the genomic sequence and the nucleotide sequences 
encoding polypeptides of Chlamydia pneumoniae, such as cellular envelope polypeptides, which are 
secreted or specific, or which are involved in metabolism, in the replication process or in virulence, 

1 0 polypeptides encoded by such sequences, as well as vectors including the said sequences and cells or 
animals transformed with these vectors. The invention also relates to transcriptional gene products of 
the Chlamydia pneumoniae genome, such as, for example, antisense and ribozyme molecules, which 
can be used to control growth of the microorganism. The invention also relates to methods of 
detecting these nucleic acids or polypeptides and kits for diagnosing Chlamydia pneumoniae infection. 

15 The invention also relates to a method of selecting compounds capable of modulating bacterial 
infection and a method for the biosynthesis or biodegradation of molecules of interest using the said 
nucleotide sequences or the said polypeptides. The invention finally comprises, pharmaceutical, in 
particular vaccine, compositions for the prevention and/or treatment of bacterial, in particular 
Chlamydia pneumoniae, infections. 

20 Comparative analysis of the sequence of the gene encoding the ribosomal 16S RNA has 

been widely used for the phylogenetic study of prokaryotes. This approach has made it possible to 
classify the Chlamydiae among the eubacteria, among which they represent a well-isolated group, 
with, nevertheless, a very weak link with the planctomyces. The Chlamydiae thus exhibit some unique 
characteristics within the eubacteria, in particular their development cycle and the structure of their 

25 membranes. They have a unique two-phase cell cycle: the elementary body, a small extracellular form, 
attaches to the host and is phagocytosed; in the phagosome, it is converted to the replicative 
intracellular form, the reticulate body. The Chlamydiae are obligate intracellular bacteria which 
multiply in eukaryotic cells at the expense of their energy reserves and nucleotide pools; they are 
responsible for a wide variety of diseases in mammals and birds. The Chlamydiae are the only 

3 0 members of the order Chlamydiales, of the family Chlamydiaceae and of the genus Chlamydia. Within 
the genus Chlamydia, four species are currently described: Chlamydia trachomatis, Chlamydia 
psittaci, Chlamydia pneumoniae and Chlamydia pecorum. These bacteria are grouped together and 
share biological and biochemical properties. Among them, only the first three infect humans, 
Chlamydia pecorum being a pathogen of ruminants. 

35 The species Chlamydia psittaci infects many animals, in particular birds, and is 

transmissible to humans. It is responsible for atypical pneumonia, for hepatic and renal dysfunction, 
for endocarditis and for conjunctivitis. 
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The species Chlamydia trachomatis is the best characterized. Besides a murine strain, 
it is divided into two groups which are distinguishable by the nature of the diseases for which they are 
responsible: trachoma, genital attack and venereal lymphogranulomatosis. There are fifteen human 
serotypes of Chlamydia trachomatis (A, K) and LGV (LI, L2, L3). Strains A to C are mainly found in 
5 eye infections, whereas strains D to K and LGV are essentially responsible for genital entry infections. 
It should be mentioned that the LGV strains are responsible for systemic diseases. Historically, it was 
in 1906 that Halberstaeder and VonProvaseck discovered, in trachoma patients, the presence of 
inclusions in the cytoplasm of the cells derived from conjunctival scrapings. In 1940, Rake and Jones 
described these same inclusions in certain cells obtained by puncturing the ganglia from a patient 
10 suffering from venereal granulomatosis. Characterization of the Chlamydia trachomatis 
microorganism was only successfully carried out in 1957, after a series of isolations in cell cultures. 

It was in 1983 that Chlamydia pneumoniae was recognized as a human pathogen 
(GraystonJT etal., 1986); since then, special attention has been paid to this bacterium and it is 
estimated (Gaydos CA et al., 1994) that 10% of pneumonias, and 5% of bronchitides and sinusites are 
15 attributable to Chlamydia pneumoniae (AldousMB etal., 1992). More recently, the association of 
this bacterium with the pathogenesis of asthmatic disease and of cardiovascular impairments is 

increasingly of interest. 

Serological studies have made it possible to observe that Chlamydia pneumoniae 
infection is common in children between 5 and 16 years of age. Before this age, it is rare to find 
2 0 antibodies; the increase in the number of individuals carrying antibodies is then correlated with age up 
to 20 years. Accordingly, 50% of adults are carriers of antibodies, it being possible for this prevalence 
to be as high as 75%. These figures are all the more striking since a first infection induces antibody 
levels of which the persistence over time is limited to 3 or at most 5 years, which suggests frequent 
reinfection during the entire lifespan. The annual seroconversion rate is about 8% between 8 and 
25 12 years and about 6% between 12 and 16 years (Haidl et al, 1994). Before the age of 15 years, the 
seroprevalence of the disease is identical between both sexes. After this age, men are more frequently 
infected than women; this is true in all regions worldwide where such studies have been carried out. 

These infections are geographically highly widespread, as shown by numerous studies 
carried out throughout the world (KanamotoY etal., 1991; TongCY etal., 1993). Developed 
30 countries of the north such as Canada, Denmark and Norway have the lowest infection rates; 
conversely, the highest prevalence rates are found in the less developed countries of tropical regions 
where the infection may occur before the age of 5 years. 

Humans are the only known reservoir for Chlamydia pneumoniae and it is probable that 
the infection is caused by direct transmission, respiratory secretions probably being responsible for 
35 this low-yield transmission (Aldous et al., 1992). The chain of transmission may also appear to be 
indirect (Kleemola M et al., 1988), suggesting that the infection is caused by an effective transmission, 
but also that asymptomatic carriers exist, which could explain the high prevalence of the disease. 
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Other studies (Mordhorst CH et al., 1992) show that the efficiency of the transmission varies 
according to the individuals and list cases of infection affecting all or the majority of members of one 
family or of a group of families. The period of incubation is several weeks, significantly longer in this 
regard than that of many other respiratory pathogenic agents. Although under conditions of high 
5 relative humidity the infectivity of Chlamydia pneumoniae in the open air decreases rapidly, 
suggesting a direct mode of transmission under these conditions, it is probable that the transmission 
occurs in some cases indirectly since the microorganism can survive for up to 30 hours in a hostile 
environment (Falsey et al., 1993). 

Clinical manifestations due to Chlamydia pneumoniae are essentially respiratory 

10 diseases. Pneumonia and bronchitis are the most frequent because they are clinically patent: since 
etiological diagnosis is evoked in this case, the infectious agent is identified. The asymptomatic 
diseases are probably numerous (Grayston JT et al., 1992; Grayston JT et al., 1986; Thorn DH et al., 
1990). The disease then progresses via bronchitis or pneumonia; fever is absent at the time of 
examination but is sometimes reported by the patient. The degree of seriousness of the disease is 

15 variable and in hospitalized patients, it is common to observe pleural effusion; a generalized infection 
may also be observed and, in severe cases, anatomicopathological examination shows Chlamydia 
pneumoniae diseases. 

Other syndromes such as sinusitis (Hashiguchi K et al., 1992), purulent otitis media 
(Ogawa H et al., 1992), or pharyngitis (Huovinen P et al., 1989) have been described, as well as 
20 infections with respiratory impairments similar to asthma (Hahn DL etal., 1991). Chlamydia 
pneumoniae has also been associated with sarcoidosis, with erythema nodosum (Sundelof et al., 1993) 
and one case of Guillain-Barre syndrome has even been described (Haidl et al., 1992). The 
involvement of Chlamydia pneumoniae in Reiter's syndrome has also been evaluated (Braun J et al., 
1994). 

25 The association of Chlamydia pneumoniae with coronary diseases and with myocardial 

infarction was first suspected from the observation of the high antibody level in 71% of patients 
having a heart disease (ShorA etal., 1992; Kuo CC etal., 1993; Puolakkainen M etal., 1993; 
Thomas GN et al., 1997). Studies carried out in several countries have shown similar results in 
patients with atheromatous impairments (Shor A et al., 1992; Kuo CC et al., 1993; Puolakkainen M 

30 et al., 1993; Grayston JT et al., 1996; Casas-Ciria J et al., 1996; Thomas GN et al., 1997; Jackson LA 
etal., 1997) and in patients with carotid impairments. Anatomicopathological and microbiological 
studies have detected Chlamydia pneumoniae in the vessels. The electron microscope has made it 
possible to visualize the bacterium (Ladany S et al., 1989), which has in fact been demonstrated by 
other techniques such as PCR (Campbell LA et al., 1992; Kuo CC et al., 1993; Kuo CC et al., 1988). It 

35 also appears that the bacterium is more frequently found in old atheromatous lesions. Other studies 
carried out on young subjects from 15 to 35 years have given the opportunity to study the coronary 
arteries of people without atherosclerosis, this observation not being possible in older subjects (the 
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onset of the atheromatous disease is early). In these young subjects, the PCR studies did not find 
Chlamydia pneumoniae in subjects free of atheromatous disease, but revealed the presence of 
Chlamydia pneumoniae in two of the eleven subjects who showed early lesions and in six of the seven 
subjects who developed atheroma plaques. These studies therefore show that the atheroma plaque is 
5 very strongly correlated with the presence of Chlamydia pneumoniae, but the role played by the 
bacterium in vascular pathology is not yet defined. 

The data relating to controlled clinical studies analysing the effect of treatments in 
Chlamydia pneumoniae infections are limited in number. Unlike penicillin, ampicillin or the 
sulphamides, erythromycin, tetracycline or doxycycline show an antibiotic activity in vitro against 
10 Chlamydia pneumoniae. However, a treatment at high doses should be continued for several weeks in 
order to avoid a recurrence of the infection. Accordingly, the use of two new macrolides, 
clarithromycin and azithromycin, whose diffusion, bioavailability and half-life allow shorter and better 
tolerated cures, is nowadays preferred. In the absence of definitive proof based on the results of 
clinical studies, an effective, without recurrences, and well-tolerated treatment of Chlamydia 
1 5 pneumoniae infections therefore remains desirable. 

An even more important need up until now relates to a specific and sensitive diagnosis, 
which can be carried out conveniently and rapidly, allowing early screening for the infection. Methods 
based on Chlamydia pneumoniae culture are slow and require a considerable know-how because of the 
difficulty involved in the collection, preservation and storage of the strain under appropriate 
20 conditions. Methods based on antigen detection (EIA, DFA) or on nucleic acid amplification (PCR) 
provide tests which are more suitable for laboratory practice. A reliable, sensitive and convenient test, 
which allows distinction between serogroups and a fortiori between Chlamydia pneumoniae species is 

therefore highly desirable. 

This is all the more important since the symptoms of Chlamydia pneumoniae infection 
25 appear slowly, since all the pathologies associated with these infections have not yet been identified, 
and since, as has been mentioned above, an association is suspected between these infections and 
serious chronic infections, asthma or atherosclerosis. 

No vaccine is yet available against Chlamydia pneumoniae: this is due to the labile nature 
of the antigens specific to the strain, which has so far prevented their specific identification. 
30 Although the number of studies and of animal models developed is high, the antigens 

used have not induced sufficient protective immunity to lead to the development of human vaccines. 
In the case of Chlamydia pneumoniae, the role of the immune defense in the physiology and pathology 
of the disease should probably be understood in order to develop satisfactory vaccines. 

More detailed information relating to the biology of these strains, their interactions with 
3 5 their hosts, the associated phenomena of infectivity and those of escaping the immune defenses of the 
host in particular, and finally their involvement in the development of the these associated pathologies, 
will allow a better understanding of these mechanisms. In the light of the preceding text which shows 
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in particular the limitations of the means of controlling Chlamydia pneumoniae infection, it is 
therefore at present essential, on the one hand, to develop molecular tools, in particular from a better 
genetic knowledge of Chlamydia pneumoniae y but also to develop new preventive and therapeutic 
treatments, new diagnostic methods and new vaccine strategies which are specific, effective and 
5 tolerated. This is precisely the object of the present invention. 

The subject of the present invention is the nucleotide sequence having the sequence 
SEQ ID No. 1 of the Chlamydia pneumoniae genome. However, the invention is not limited to SEQ 
ID No. 1, but encompasses genomes and nucleotides encoding polypeptides of strain variants, 
polymorphisms, allelic variants, and mutants. 
10 Thus, the subject of the present invention encompasses nucleotide sequences 

characterized in that they are chosen from: 

a) the nucleotide sequence of SEQ ID No. 1, a nucleotide sequence exhibiting at least 
99.9% identity with the sequence SEQ ID No. 1, the nucleotide sequence of the genomic 
DNA contained within ATCC Deposit No. , the nucleotide sequence of a clone insert 

15 within ATCC Deposit No. _; 

b) a nucleotide sequence homologous to the sequence SEQ ID No. 1; 

c) a polynucleotide sequence that hybridizes to the nucleotide sequence of a) under 
conditions of high or intermediate stringency as described below: 

(i) By way of example and not limitation, procedures using conditions of high stringency are 

20 as follows: Prehybridization of filters containing DNA is carried out for 8 h to overnight at 65EC in 
buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 
0.02% BSA, and 500 jug/ml denatured salmon sperm DNA. Filters are hybridized for 48 h at 65EC, 
the preferred hybridization temperature, in prehybridization mixture containing 100 jig/ml denatured 
salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Alternatively, the hybridization step 

25 can be performed at 65EC in the presence of SSC buffer, 1 x SSC corresponding to 0.15M NaCl and 
0.05 M Na citrate. Subsequently, filter washes can be done at 37EC for 1 h in a solution containing 
2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X SSC at 50EC for 45 
min. Alternatively, filter washes can be performed in a solution containing 2 x SSC and 0.1% SDS, or 
0.5 x SSC and 0.1% SDS, or 0.1 x SSC and 0.1% SDS at 68EC for 15 minute intervals. Following 

3 0 the wash steps, the hybridized probes are detectable by autoradiography. Other conditions of high 
stringency which may be used are well known in the art and as cited in Sambrook et ah, 1989, 
Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47- 
9.57; and Ausubel et ah, 1989, Current Protocols in Molecular Biology, Green Publishing Associates 
and Wiley Interscience, N.Y. are incorporated herein in their entirety. 

35 (ii) By way of example and not limitation, procedures using conditions of intermediate 

stringency are as follows: Filters containing DNA are prehybridized, and then hybridized at a 
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temperature of 60EC in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50EC and the hybridized probes are 
detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
well known in the art and as cited in Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 
5 Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, Current 
Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. are 

incorporated herein in their entirety. 

d) a nucleotide sequence complementary to the sequence SEQIDNo.l or 
complementary to a nucleotide sequence as defined in a), b) or c) and a nucleotide 

10 sequence of their corresponding RN A; 

e) a nucleotide sequence of a representative fragment of the sequence SEQ ID No. 1 , or of 
a representative fragment of the nucleotide sequence as defined in a), b), c) or d); 

f) a nucleotide sequence comprising a sequence as defined in a), b), c), d) or e); 

g) a nucleotide sequence capable of being obtained from a nucleotide sequence as defined 
15 ina),b),c),d),e)orf);and 

h) a modified nucleotide sequence of a nucleotide sequence as defined in a), b), c), d), e), 

f) or g). 

Nucleotide sequence, polynucleotide or nucleic acid are understood to mean, according to 
the present invention, either a double-stranded DNA, a single-stranded DNA or products of 

2 0 transcription of the said DNAs. 

It should be understood that the present invention does not relate to the genomic 
nucleotide sequences of Chlamydia pneumoniae taken in their natural environment, that is to say in the 
natural state. They are sequences which may have been isolated, purified or partially purified, by 
separation methods such as, for example, ion-exchange chromatography, molecular size exclusion 
25 chromatography or affinity chromatography, or alternatively fractionation techniques based on 
solubility in various solvents, or by genetic engineering methods such as amplification, cloning or 
subcloning, it being possible for the sequences of the invention to be carried by vectors. 

The nucleotide sequence SEQ ID No. 1 was obtained by sequencing the Chlamydia 
pneumoniae genome by the method of directed sequencing after fluorescent automated sequencing of 
3 0 the inserts of clones and assembling of these sequences of nucleotide fragments (inserts) by means of 
softwares (cf. Examples). In spite of the high precision of the sequence SEQ ID No. 1, it is possible 
that it does not perfectly, 100% represent the nucleotide sequence of the Chlamydia pneumoniae 
genome and that a few rare sequencing errors or uncertainties still remain in the sequence 
SEQ ID No. 1. In the present invention, the presence of an uncertainty for an amino acid is designated 
35 by "Xaa" and that for a nucleotide is designated by "N" in the sequence listing below. These few rare 
errors or uncertainties could be easily detected and corrected by persons skilled in the art using the 
entire chromosome and/or its representative fragments according to the invention and standard 
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amplification, cloning and sequencing methods, it being possible for the sequences obtained to be 
easily compared, in particular by means of a computer software and using computer-readable media 
for recording the sequences according to the invention as described, for example, below. After 
correcting these possible rare errors or uncertainties, the corrected nucleotide sequence obtained would 
5 still exhibit at least 99.9% identity with the sequence SEQ ID No. 1. Such rare sequencing 

uncertainties are not present within the DNA contained within ATCC Deposit No. or , and 

whatever rare sequence uncertainties that exist within SEQ ID No. 1 can routinely be corrected 
utilizing the DNA of the ATCC deposits. 

Homologous nucleotide sequence for the purposes of the present invention is understood 

10 to mean a nucleotide sequence having a percentage identity with the bases of the nucleotide sequence 
SEQ ID No. 1 of at least 80%, preferably 90% and 95%, this percentage being purely statistical and it 
being possible for the differences between the two nucleotide sequences to be distributed randomly 
and over their entire length. The said homologous sequences exhibiting a percentage identity with the 
bases of the nucleotide sequence SEQ ED No. 1 of at least 80%, preferably 90% and 95%, may 

15 comprise, for example, the sequences corresponding to the genomic sequence or to the sequences of 
its representative fragments of a bacterium belonging to the Chlamydia family, including the species 
Chlamydia trachomatis, Chlamydia psittaci and Chlamydia pecorum mentioned above, as well as the 
sequences corresponding to the genomic sequence or to the sequences of its representative fragments 
of a bacterium belonging to the variants of the species Chlamydia pneumoniae. In the present 

20 invention, the terms family and genus are mutually interchangeable, the terms variant, serotype, strain 
and subspecies are also mutually interchangeable. These homologous sequences may thus correspond 
to variations linked to mutations within the same species or between species and may correspond in 
particular to truncations, substitutions, deletions and/or additions of at least one nucleotide. The said 
homologous sequences may also correspond to variations linked to the degeneracy of the genetic code 

25 or to a bias in the genetic code which is specific to the family, to the species or to the variant and 
which are likely to be present in Chlamydia. 

Protein and/or nucleic acid sequence homologies may be evaluated using any of the 
variety of sequence comparison algorithms and programs known in the art. Such algorithms and 
programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and 

30 CLUSTALW (Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85(8J:2444-2448; Altschul et 
al. y 1990, J. Mol. Biol. 275^:403-410; Thompson et al. y 1994, Nucleic Acids Res. 22( 2 ):4673-4680; 
Higgins et al, 1996, Methods Enzymol. 266:383-402; Altschul et al, 1990, J. Mol Biol. 275(3j:403- 
410; Altschul et ai 7 1993, Nature Genetics 5:266-272). 

In a particularly preferred embodiment, protein and nucleic acid sequence homologies are 

35 evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well known in the art 
(see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci USA 87:2267-2268; Altschul et at, 1990, 
J. Mol Biol 275:403-410; Altschul et al, 1993, Nature Genetics 3:266-272; Altschul et al, 1997, 
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M* AcMs to 25.3389-3402). In panic.,., five specie BLAST programs to, used . perform » 

""""""^mBLASTP and BLASTS Compaq an -no acid W — ' 

sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucieotide sequence 

database; 

(3) BLASTX compares the six-frame concept elation prod*, of a quay 

nucleotide sequence (bo* snoods) against a protein sequence database; 
WPBLASTN contparea a qner, pmtein sequence against a nucleotide sequence database 

translated in all six reading (rattles (ball strands); and 
(5 )TBLASTX compares the six-tone translations of a nucleutide query sequence against 
the six-frame translations o( a nucleotide sequence database. 
T*c BLAST programs identify homologous sequences by idoniifying similar segments which tun 

* ' amioo o, „„ c | e ic acid sequence 

referred to herein as "high-scoring segment pans, between query 
15 and . test soqnence which is ptnferabl, ohrained from n protein or nucleic add s«,u«nc. datab^ 

i» *» -. - — — - " ih ; r ~ Ts 

a „, ,992, Science 2561.443 .445; Hcnihoff and Henihoff, 1993, M. I WM >. Loss 
rafnrahly, the PAM or PAU2S0 matrices may a,s„ be user, (see, ,,, Schwann and Dayh* «* ; 
20 Mnrrfces for Coring I*— M» of A— — - - — 

Waahington: National Biomedical Research Foundntton) 

The BLAST programa oyalnat. the sunisucol signiftoanc. of all high-soonng segmen 
paiB MM and preferably sel« rhoae segments which satisfy a user-specified th^hold o 
Uicance, such aa n nser-spocifteri petcon, hon»,ogy. Praferably, the — ^ - • 
2 5 hi^-scoring segment pair is ornate, usmg die smWcal signitancc fonnnla of Karlm (see. e.g., 
Karlin and Altschul, 1990, Proc. Natl- Acad. Set. USA S7;2267-2268). 

Nucleotide snqnencc complement to a sequence of the inyention ts und.ra.ood to 
— any DNA whose nucleotides are complementary to .hot, of the seqnnnon o, the invnntton, and 
whose orientation is reveraed (antiparallel sequence). 

Tho prasent invention hard,., compriana fragments o, ,h. sequences of a, through 0, 
nbove. Representative tracts of die fences according to the invention will be nndorrfoc, to 
„. any nncleurido fra^m having - 1- • — - - — *» » ~° 

Iccessivl nuclootidos. - sail, more preferebly a, le*. ,5 or leaa, 20 successive nue,„hd of *e 
sequence from which it is derived. is und.ratood thnt such frogmen, rnfo, only to pomona of SEQ 
35 nyNo.lma.amno.ounentlylla.rfinapublioly.vailabkda.abas.. 

Among .Itcso mpmsenlative fragment those cnpable of hybndramg under smugem 
condidons with a nucleotide sequence according ,» .ho invention nra prerereC. Hybridiaation under 
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stringent conditions means that the temperature and ionic strength conditions are chosen such that 
they allow hybridization to be maintained between two complementary DNA fragments. 

By way of illustration, high stringency conditions for the hybridization step for the 
purposes of defining the nucleotide fragments described above, are advantageously the following. 
5 The hybridization is carried out at a preferred temperature of 65EC in the presence of 

SSC buffer, 1 x SSC corresponding to 0.15 M NaCl and 0.05 M Na citrate. The washing steps may 
be, for example, the following: 

2xSSC, 0.1% SDS at room temperature followed by three washes with 1 x SSC, 0.1% SDS; 
0.5 x SSC, 0.1% SDS; 0.1 x SSC, 0.1% SDS at 68EC for 15 minutes. 

10 Intermediate stringency conditions, using, for example, a temperature of 60EC in the 

presence of a 5 x SSC buffer, or of low stringency, for example a temperature of 50EC in the presence 
of a 5 x SSC buffer, respectively require a lower overall complementarity for the hybridization 
between the two sequences. 

The stringent hybridization conditions described above for a polynucleotide of about 

15 300 bases in size will be adapted by persons skilled in the art for larger- or smaller-sized 
oligonucleotides, according to the teaching of Sambrook et al., 1989. 

Among the representative fragments according to the invention, those which can be used 
as primer or probe in methods which make it possible to obtain homologous sequences or their 
representative fragments according to the invention, or to reconstitute a genomic fragment found to be 

20 incomplete in the sequence SEQIDNo. 1 or carrying an error or an uncertainty, are also preferred, 
these methods, such as the polymerase chain reaction (PCR), cloning and sequencing of nucleic acid 
being well known to persons skilled in the art. These homologous nucleotide sequences 
corresponding to mutations or to inter- or intra-species variations, as well as the complete genomic 
sequence or one of its representative fragments capable of being reconstituted, of course form part of 

25 the invention. 

Among the said representative fragments, those which can be used as primer or probe in 
methods allowing diagnosis of the presence of Chlamydia pneumoniae or one of its associated 
microorganisms as defined below are also preferred. 

The representative fragments capable of modulating, regulating, inhibiting or inducing 
3 0 the expression of a gene of Chlamydia pneumoniae or one of its associated microorganisms, and/or 
capable of modulating the replication cycle of Chlamydia pneumoniae or one of its associated 
microorganisms in the host cell and/or organism, are also preferred. Replication cycle is intended to 
designate invasion, multiplication, intracellular localization, in particular retention in the vacuole and 
inhibition of the process of fusion to the lysosome, and propagation of Chlamydia pneumoniae or one 
35 of its associated microorganisms from host cells to host cells. 

Among the said representative fragments, those corresponding to nucleotide sequences 
corresponding to open reading frames, called ORF sequences (ORF for open reading frame), and 



10 



PCT/IB98/01890 

WO 99/27105 

10 

encoding polypeptides, sueh as for example, bu, without being limited thereto, the ORF sequences 
which will be later described, are finally preferred. 

The representative fragments according to the invention may be obtained, for example, 
by specific amplification, such as PGR, or after digestion, with appropriate restriction enzymes, of 
5 nucleotide sequences according to the invention; these methods are in particular descnbed m the 
m anual by Sambrook et al., 1989. The said representative fragments may also be obtamed by 
chemical synthesis when they are not too large in size and according to methods well known to 
persons skilled in the art. For example, such fragments can be obtained by isolating fragments of the 
genomic DNA of ATCC Deposit No. _ or a clone insert present at this ATCC Depos.t No. _. 

The representative fragments according to the invention may be used, for example, as 
primer to reconstitute some of the said representative fragments, in particular those in which a portton 
of the sequence is likely to be missing or imperfect, by methods well known to persons skilled ,n the 
art such as amplification, cloning or sequencing techniques. 

Modified nucleotide sequence will be understood to mean any nucleotide sequence 
15 obtained by mutagenesis according to techniques well known to persons skilled in the art, and 
exhibiting modifications in relation to the normal sequences, for example mutations in the regulatory 
and/or promoter sequences for the expression of a polypeptide, in particular leading to a modificatton 
of the level of expression of the said polypeptide or to a modulation of the replicati ve cycle. 

Modified nucleotide sequence will also be understood to mean any n„cleot,de sequence 
20 encoding a modified polypeptide as defined below. 

The subject of the present invention also includes Chlamydia pneumoniae nucleot.de 
sequences characterized in that they are chosen from a nucleotide sequence of an open reading frame 
(ORF), that is, the ORF2 to ORF1297 sequences. 

The ORF2 to ORF1297 nucleotide sequences are defined in Tables 1 and 2, infra, by 
25 their position on the sequence SEQ ID No. 1. For example, the ORF2 sequence is defined by the 
nucleotide sequence between the nucleotides at position 42 and 794 on the sequence SEQ ID No. 1, 
ends included ORF2 to ORF1297 have been identified via homology analyses as well as v.a analyses 
of potential ORF start sites, as discussed in the examples be.ow. It is to be understood that each 
identified ORF of the invention comprises a nucleotide sequence that spans the contiguous nucleot.de 
30 sequence from the ORF stop codon immediately 3' to the stop codon of the preceding ORF and 
through the 5' codon to the next stop codon of SEQ ID No,l in-frame to the ORF nucleot.de 
sequence. Tab.e 2, infra, lists the beginning, end and potential start site of each of ORFs 1-1297. In 
one embodiment, the ORF comprises the contiguous nuc.eotide sequence spanning from the potennal 
ORF start site downstream (that is, 3') to the ORF stop codon (or the ORF codon immediately adjacent 
35 to and upstream of the ORF stop codon). ORF2 to ORF1 297 encode the polypeptides of SEQ ©No. 
2 to SEQ ID No. 1291 and of SEQ ID No. 6844 to SEQ ID No. 6849, respectively. 

Upon introduction of minor frameshifts, certain individual ORFs can compnse larger 
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"combined" ORFs. A list of such putative "combined" ORFs is shown in Table 3, below. 
For example, a combined ORF can comprise ORF 25, ORF 26 and ORF 27, including intervening in- 
frame, nucleotide sequences. The order of ORFs (5' to 3'), within each "combined" ORF is as listed. 
It is to be understood that when ORF2 to ORF1297 are referred to herein, such reference is also meant 
5 to include "combined" ORFs. Polypeptide sequences encoded by such "combined" ORFs are also part 
of the present invention. 

Table 3 

ORF 25, ORF 26, ORF 27; 
10 ORF 28, ORF 29, ORF 30; 

ORF 31, ORF 32; 

ORF 33, ORF 35; 

ORF 466, ORF 467; 

ORF 468, ORF 469; 
15 ORF 477, ORF 476, ORF 474; 

ORF 480, ORF 482; 

ORF 483, ORF 485, ORF 486, ORF 500; 
ORF 503, ORF 504, ORF 505; 
ORF 506, ORF 507; 
20 ORF 1211, ORF 647; 
ORF 1286, ORF 1039; 
ORF 691, ORF 690; 
ORF 105, ORF 106; 

ORF 170, ORF 171;ORF 394, ORF 393; 
25 ORF 453, ORF 452, ORF 451; 

ORF 526, ORF 525; 

ORF 757, ORF 756, ORF 755; 

ORF 856, ORF 855; 

ORF 958, ORF 957; 
30 ORF 915, ORF 914, ORF 913; 

ORF 543, ORF 544; 

ORF 1266, ORF 380; 

ORF 745, ORF 744; 

ORF 777, ORF 776; 
3 5 ORF 343, ORF 1297, and representative fragments. 



Table 1 also depicts the results of homology searches that compared the sequences of the 
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polypeptides encoded by each of the ORFs to sequences present in public published databases. 
It is understood that those polypeptides listed in Table 1 as exhibiting greater than about 95% identity 
to a polypeptide present in a publicly disclosed database are not considered part of the present 
invention; likewise in this embodiment, those nucleotide sequences encoding such polypeptides are 
5 not considered part of the invention. In another embodiment, it is understood that those polypeptide, 
listed in Table 1 as exhibiting greater than about 99% identity to a polypeptide present in a pubhcly 
disclosed database are not considered part of the invention; likewise, in this embodiment, those 
nucleotide sequences encoding such polypeptides are not considered part of the invention. 

The invention also relates to the nucleotide sequences characterized in that they compnse 

10 a nucleotide sequence chosen from: 

a) an ORF2 to ORF1297, a "combined" ORF nucleotide sequence, the nucleotide sequence of the 

genomic DNA contained within ATCC Deposit No or the nucleotide sequence of a clone 

insert in ATCC Deposit No according to the invention; 

b) a homologous nucleotide sequence exhibiting at least 80% identity across an entire ORF2 to 
1 5 ORF1297 nucleotide sequence according to the invention or as defined in a); 

c) a polynucleotide sequence that hybridizes to ORF2 to ORF1297 under conditions of high or 
intermediate stringency as described below; 

(i) By way of example and not limitation, procedures using conditions of high stringency are as 
follows- Prehybridization of filters containing DNA is carried out for 8 h to overnight at 65EC in 
20 buffer composed of 6X SSC, 50 mM Tris-HCl ( P H 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 
0.02% BSA, and 500 ug/ml denatured salmon sperm DNA. Filters are hybridized for 48 h at 65EC, 
the preferred hybridization temperature, in prehybridization mixture containing 100 ug/ml denatured 
salmon sperm DNA and 5-20 X 10 6 epm of ^-labeled probe. Alternatively, the hybridization step 
can be performed at 65EC in the presence of SSC buffer, 1 x SSC corresponding to 0.15M NaCl and 
25 0.05 M Na citrate. Subsequently, filter washes can be done at 37EC for 1 h in a solution containing 
2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X SSC at 50EC for 45 
nun. Alternatively, filter washes can be performed in a solution containing 2 x SSC and 0.1% SDS, or 
0 5 x SSC and 0.1% SDS, or 0.1 x SSC and 0.1% SDS at 68EC for 15 minute intervals. Following 
the wash steps, the hybridized probes are detectable by autoradiography. Other conditions of high 
30 stringency which may be used are well known in the art and as cited in Sambrook et at, 1989, 
Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 
9 47-9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green Publishing 
Associates and Wiley Interscience, N.Y. are incorporated herein in their entirety. Preferably, such 
sequences encode a homolog of a polypeptide encoded by one of ORF2 to ORF1297. In one 
3 5 embodiment, such sequences encode a Chlamydia pneumoniae polypeptide. 

(ii) By way of example and not limitation, procedures using conditions of intermediate 



WO 99/27105 PCT/1B98/01890 

13 

stringency are as follows: Filters containing DNA are prehybridized, and then hybridized at a 
temperature of 60EC in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50EC and the hybridized probes are 
detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
5 well known in the art and as cited in Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 
Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, Current 
Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. are 
incorporated herein in their entirety. Preferably, such sequences encode a homolog of a polypeptide 
encoded by one of ORF2 to ORF1297. In one embodiment, such sequences encode a Chlamydia 
1 0 pneumoniae polypeptide. 

d) complementary or RNA nucleotide sequence corresponding to an ORF2 to ORF1297 sequence 
according to the invention or as defined in a), b) or c); 

e) a nucleotide sequence of a representative fragment of an ORF2 to ORF1297 sequence according 
to the invention or of a sequence as defined in a), b), c) or d); 

15 0 a nucleotide sequence capable of being obtained from an ORF2 to ORF1297 sequence according 
to the invention or as defined in a), b), c), d) or e); and 

g) a modified nucleotide sequence of an ORF2 to ORF1297 sequence according to the invention or 
as defined in a), b), c), d), e) or f); 

As regards the homology with the ORF2 to ORF1297 nucleotide sequences, the 

20 homologous sequences exhibiting a percentage identity with the bases of one of the ORF2 to 
ORF1297 nucleotide sequences of at least 80%, preferably 90% and 95%, are preferred. Such 
homologous sequences are identified routinely via, for example, the algorithms described above and in 
the examples below. The said homologous sequences correspond to the homologous sequences as 
defined above and may comprise, for example, the sequences corresponding to the ORF sequences of 

25 a bacterium belonging to the Chlamydia family, including the species Chlamydia trachomatis, 
Chlamydia psittaci and Chlamydia pecorum mentioned above, as well as the sequences corresponding 
to the ORF sequences of a bacterium belonging to the variants of the species Chlamydia pneumoniae. 
These homologous sequences may likewise correspond to variations linked to mutations within the 
same species or between species and may correspond in particular to truncations, substitutions, 

30 deletions and/or additions of at least one nucleotide. The said homologous sequences may also 
correspond to variations linked to the degeneracy of the genetic code or to a bias in the genetic code 
which is specific to the family, to the species or to the variant and which are likely to be present in 
Chlamydia. 

The invention comprises polypeptides encoded by a nucleotide sequence according to the 
3 5 invention, preferably by a representative fragment of the sequence SEQ ID No, 1 and corresponding to 
an ORF sequence, in particular the Chlamydia pneumoniae polypeptides, characterized in that they are 
chosen from the sequences SEQ ID No. 2 to SEQ ID No. 1291 or SEQ ID No. 6844 to SEQ ID No. 
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6849 and representative fragments thereof. However, the invention is not limited to 
polypeptides encoded by ORFs in SEQ ID No. 1 and its corresponding ORF sequences, but 
encompasses polypeptides of strain variants, polymorphisms, allelic variants, and mutants. 

Thus, the invention also comprises the polypeptides characterized in that they comprise a 

5 polypeptide chosen from: 

a) a polypeptide encoded by a polynucleotide sequence in SEQ ID No. 1 (e.g., any polypeptide 
encoded by a polynucleotide sequence corresponding to ORF2 to ORF1297 and/or representative 
fragments thereof) according to the invention; 

b) a polypeptide homologous to a polypeptide according to the invention, or as defined in a); 

10 c) a polypeptide encoded by a polynucleotide sequence that hybridizes to SEQ ID No. 1 or ORF2 to 
ORF1297 under high or intermediate stringency as described below: 

(i) By way of example and not limitation, procedures using conditions of high stringency are as 
follows: Prehybridization of filters containing DNA is carried out for 8 h to overnight at 65EC in 
buffer composed of 6X SSC, 50 mM Tris-HCl (pH7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 
15 0.02% BSA, and 500 ug/ml denatured salmon sperm DNA. Filters are hybridized for 48 h at 65EC, 
the preferred hybridization temperature, in prehybridization mixture containing 100 ug/ml denatured 
salmon sperm DNA and 5-20 X 10 6 cpm of "P-labeled probe. Alternatively, the hybridization step 
can be performed at 65EC in the presence of SSC buffer, 1 x SSC corresponding to 0.15M NaCl and 
0.05 M Na citrate. Subsequently, filter washes can be done at 37EC for 1 h in a solution containing 
20 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X SSC at 50EC for 45 
min. Alternatively, filter washes can be performed in a solution containing 2 x SSC and 0.1% SDS, or 
0.5 x SSC and 0.1% SDS, or 0.1 x SSC and 0.1% SDS at 68EC for 15 minute intervals. Following 
the wash steps, the hybridized probes are detectable by autoradiography. Other conditions of high 
stringency which may be used are well known in the art and as cited in Sambrook et al., 1989, 
25 Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47- 
9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green Publishing Associates 
and Wiley Interscience, N.Y. are incorporated herein in their entirety. Preferably such polypeptide 
represents a homolog of a polypeptide encoded by ORF2 to ORF1297. Preferably, such sequences 
encode a homolog of a polypeptide encoded by one of ORF2 to ORF1297. In one embodiment, such 
3 0 sequences encode a Chlamydia pneumoniae polypeptide. 

(ii) By way of example and not limitation, procedures using conditions of intermediate 
stringency are as follows: Filters containing DNA are prehybridized, and then hybridized at a 
temperature of 60EC in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50EC and the hybridized probes are 
35 detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
well known in the art and as cited in Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 
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Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, Current 
Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. are 
incorporated herein in their entirety. Preferably, such sequences encode a homolog of a polypeptide 
encoded by one of ORF2 to ORF1297. In one embodiment, such sequences encode a Chlamydia 
5 pneumoniae polypeptide. 

d) a fragment of at least 5 amino acids of a polypeptide according to the invention, or as defined in 
a), b) or c); 

e) a biologically active fragment of a polypeptide according to the invention, or as defined in a), b), 
c) or d); and 

10 0 a modified polypeptide of a polypeptide according to the invention, as defined in a), b), c),d) or 
e). 

In the present description, the terms polypeptide, peptide and protein are interchangeable. 
It should be understood that the invention does not relate to the polypeptides in natural 
form, that is to say that they are not taken in their natural environment but that they may have been 

15 isolated or obtained by purification from natural sources, or alternatively obtained by genetic 
recombination, or else by chemical synthesis and that they may, in this case, comprise nonnatural 
amino acids, as will be described below. 

Homologous polypeptide will be understood to designate the polypeptides exhibiting, in 
relation to the natural polypeptide, certain modifications such as in particular a deletion, addition or 

2 0 substitution of at least one amino acid, a truncation, an extension, a chimeric fusion, and/or a mutation, 
or polypeptides exhibiting post-translational modifications. Among the homologous polypeptides, 
those whose amino acid sequence exhibits at least 80%, preferably 90%, homology or identity with the 
amino acid sequences of the polypeptides according to the invention are preferred. In the case of a 
substitution, one or more consecutive or nonconsecutive amino acids are replaced by "equivalent" 

25 amino acids. The expression "equivalent" amino acid is intended here to designate any amino acid 
capable of being substituted for one of the amino acids in the basic structure without, however, 
essentially modifying the biological activities of the corresponding peptides and as will be defined 
later. 

Protein and/or nucleic acid sequence homologies may be evaluated using any of the 
30 variety of sequence comparison algorithms and programs known in the art. Such algorithms and 
programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and 
CLUSTALW (Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85(8): 2444-244 8; Altschul et 
al, 1990, J. Mol Biol 275^:403-410; Thompson et al, 1994, Nucleic Acids Res. 22(2^:4673-4680; 
Higgins et al, 1996, Methods Enzymol. 266:383-402; Altschul et al, 1990, J. Mol Biol 2J5(3):403- 
35 410; Altschul et al, 1993, Nature Genetics 3:266-272). 

In a particularly preferred embodiment, protein and nucleic acid sequence homologies are 
evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well know in the art (see, 
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e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 87:2267-2268; Altschul et at., 1990, J. 
Mol Biol. 2/5:403-410; Altschul et al, 1993, Nature Genetics 3:266-272; Altschul et al, 1997, Nuc. 
Acids Res. 25:3389-3402). In particular, five specific BLAST programs are used to perform the 
following task: 

5 (l)BLASTP and BLAST3 compare an amino acid query sequence against a protein 

sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 

database; 

(3) BLASTX compares the six-frame conceptual translation products of a query 
x 0 nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database 

translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against 

the six-frame translations of a nucleotide sequence database. 
15 The BLAST programs identify homologous sequences by identifying similar segments, which are 
referred to herein as "high-scoring segment pairs," between a query amino or nucleic acid sequence 
and a test sequence which is preferably obtained from a protein or nucleic acid sequence database. 
High-scoring segment pairs are preferably identified {i.e., aligned) by means of a scoring matrix, many 
of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 matrix (Gonnet 
20 et al., 1992, Science 256:1443-1445; Henikoff and Henikoff, 1993, Proteins 77:49-61). Less 
preferably, the PAM or PAM250 matrices may also be used (see, e.g., Schwartz and Dayhoff, eds., 
1978, Matrices for Detecting Distance Relationships: Atlas of Protein Sequence and Structure, 
Washington: National Biomedical Research Foundation) 

The BLAST programs evaluate the statistical significance of all high-scoring segment 
25 pairs identified, and preferably selects those segments which satisfy a user-specified threshold of 
significance, such as a user-specified percent homology. Preferably, the statistical significance of a 
high-scoring segment pair is evaluated using the statistical significance formula of Karlin (see, e.g., 
Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 87:2267-2268). 

Equivalent amino acids may be determined either based on their structural homology 
30 with the amino acids for which they are substituted, or on results of comparative tests of biological 
activity between the various polypeptides which may be carried out. 

By way of example, there may be mentioned the possibilities of substitutions which may 
be carried out without resulting in a substantial modification of the biological activity of the 
corresponding modified polypeptides; the replacements, for example, of leucine with valine or 
3 5 isoleucine, of aspartic acid with glutamic acid, of glutamine with asparagine, of arginine with lysine, 
and the like, the reverse substitutions naturally being feasible under the same conditions. 

The homologous polypeptides also correspond to the polypeptides encoded by the 
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homologous nucleotide sequences as defined above and thus comprise in the present definition 
the mutated polypeptides or polypeptides corresponding to inter- or intra-species variations which may 
exist in Chlamydia, and which correspond in particular to truncations, substitutions, deletions and/or 
additions of at least one amino acid residue. 
5 Biologically active fragment of a polypeptide according to the invention will be 

understood to designate in particular a polypeptide fragment, as defined below, exhibiting at least one 
of the characteristics of the polypeptides according to the invention, in particular in that it is: 
capable of eliciting an immune response directed against Chlamydia pneumoniae', and/or 
capable of being recognized by an antibody specific for a polypeptide according to the invention; 
10 and/or 

capable of binding to a polypeptide or to a nucleotide sequence of Chlamydia pneumoniae; and/or 
capable of modulating, regulating, inducing or inhibiting the expression of a gene of Chlamydia 
pneumoniae or one of its associated microorganisms, and/or capable of modulating the 
replication cycle of Chlamydia pneumoniae or one of its associated microorganisms in the 
15 host cell and/or organism; and/or 

capable of generally exerting an even partial physiological activity, such as for example a 
structural activity (cellular envelope, ribosome), an enzymatic (metabolic) activity, a transport 
activity, an activity in the secretion or in the virulence. 

A polypeptide fragment according to the invention is understood to designate a 
20 polypeptide comprising a minimum of 5 amino acids, preferably 10 amino acids or preferably 
15 amino acids. It is to be understood that such fragments refer only to portions of polypeptides 
encoded by ORF2 to ORF1297 that are not currently listed in a publicly available database. 

The polypeptide fragments according to the invention may correspond to isolated or 
purified fragments which are naturally present in Chlamydia pneumoniae or which are secreted by 
25 Chlamydia pneumoniae, or may correspond to fragments capable of being obtained by cleaving the 
said polypeptide with a proteolytic enzyme, such as trypsin or chymotrypsin or collagenase, or with a 
chemical reagent, such as cyanogen bromide (CNBr) or alternatively by placing the said polypeptide 
in a highly acidic environment, for example at pH 2.5. Such polypeptide fragments may be equally 
well prepared by chemical synthesis, using hosts transformed with an expression vector according to 
30 the invention containing a nucleic acid allowing the expression of the said fragments, placed under the 
control of appropriate elements for regulation and/or expression. 

"Modified polypeptide" of a polypeptide according to the invention is understood to 
designate a polypeptide obtained by genetic recombination or by chemical synthesis as will be 
described below, exhibiting at least one modification in relation to the normal sequence. These 
35 modifications may in particular affect amino acids responsible for a specificity or for the efficiency of 
the activity, or responsible for the structural conformation, for the charge or for the hydrophobicity, 
and for the capacity for multimerization and for membrane insertion of the polypeptide according to 
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the invention. It is thus possible to create polypeptides with an equivalent, an increased or a 
reduced activity, and with an equivalent, a narrower or a broader specificity. Among the modified 
polypeptides, there may be mentioned the polypeptides in which up to 5 amino acids may be modified, 
truncated at the N- or C-terminal end, or alternatively deleted, or else added. 
5 As is indicated, the modifications of the polypeptide may have in particular the objective: 

- of making it capable of modulating, regulating, inhibiting or inducing the expression of a gene of 
Chlamydia, in particular of Chlamydia pneumoniae and its variants, or one of its associated 
microorganisms, and/or capable of modulating the replication cycle of Chlamydia, in 
particular of Chlamydia pneumoniae and its variants, or one of its associated microorganisms, 

10 in the host cell and/or organism, 

of allowing its use in methods of biosynthesis or of biodegradation, or its incorporation into 

vaccine compositions, 
- of modifying its bioavailability as a compound for therapeutic use. 

The said modified polypeptides may also be used on any cell or microorganism for which 
15 the said modified polypeptides will be capable of modulating, regulating, inhibiting or inducing gene 
expression, or of modulating the growth or the replication cycle of the said cell or of the sa,d 
microorganism. The methods allowing demonstration of the said modulations on eukaryotic or 
prokaryotic cells are well known to persons skilled in the art. The said ce.ls or microorganisms will be 
chosen, in particular, from tumour cells or infectious microorganisms and the said modified 
20 polypeptides may be used for the prevention or treatment of pathologies linked to the presence of the 
said cells or of the said microorganisms. It is also clearly understood mat the nucleotide sequences 
encoding the said modified polypeptides may be used for the said modulations, for example by the 
intermediacy of vectors according to the invention and which are described below, so as to prevent or 

to treat the said pathologies. 
25 The above modified polypeptides may be obtained using combinatory chemistry, in 

which it is possible to systematically vary portions of the polypeptide before testing them on models, 
cell cultures or microorganisms for example, so as to select the compounds which are the most active 
or which exhibit the desired properties. 

Chemical synthesis also has the advantage of being able to use: 
3 o - nonnatural amino acids, or 

- nonpeptide bonds. 

Accordingly, in order to extend the life of the polypeptides according to the invention, it 
may be advantageous to use nonnatural amino acids, for example in the D form, or alternatively amino 
acid analogues, in particular sulphur-containing forms for example. 

Finally, the structure of the polypeptides according to the invention, its homologous or 
modified forms, as well as the corresponding fragments may be integrated into chemical structures of 
the polypeptide type and the like. Accordingly, it may be advantageous to provide at the N- and C- 
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terminal ends compounds which are not recognized by proteases. 

Also forming part of the invention are the nucleotide sequences encoding a polypeptide 
according to the invention. Described below are ORF nucleotide sequences encoding polypeptides 
exhibiting particularly preferable characteristics. For each group of preferred ORFS described below, 
5 it is to be understood that in addition to the individual ORFs listed, in instances wherein such ORFS 
are present as part of "combined" ORFs, the "combined" ORFs are also to be included within the 
preferred group. 

More particularly, the subject of the invention is nucleotide sequences, characterized in 
that they encode a polypeptide of the cellular envelope, preferably of the outer cellular envelope of 
10 Chlamydia pneumoniae or one of its representative fragments, such as for example the predominant 
proteins of the outer membrane, the adhesion proteins or the proteins entering into the composition of 
the Chlamydia wall. Among these sequences, the sequences comprising a nucleotide sequence chosen 
from the following sequences are most preferred: 

ORF15; ORF25; ORF26; ORF27; ORF28; ORF29; ORF30; ORF31; ORF32; ORF33; ORF35; 

15 ORF68; ORF124; ORF275; ORF291; ORF294; ORF327; ORF342; ORF364; ORF374; ORF380; 
ORF414; ORF439; ORF466; ORF467; ORF468; ORF469; ORF470; ORF472; ORF474; ORF476; 
ORF477; ORF478; ORF479; ORF480; ORF482; ORF485; ORF500; ORF501; ORF503; ORF504; 
ORF505; ORF506; ORF520; ORF578; ORF580; ORF581; ORF595; ORF596; ORF597; ORF737; 
ORF830; ORF834; ORF836; ORF893; ORF917; ORF932; ORF976; ORF1035; ORF1045; ORF1090 

2 0 and one of their representative fragments. 

The structure of the cytoplasmic membranes and of the wall of bacteria is dependent on 
the associated proteins. The structure of the cytoplasmic membrane makes it impermeable to water, to 
water-soluble substances and to small-sized molecules (ions, small inorganic molecules, peptides or 
proteins). To enter into or to interfere with a cell or a bacterium, a ligand must establish a special 

25 relationship with a protein anchored in the cytoplasmic membrane (the receptor). These proteins 
which are anchored on the membrane play an important role in metabolism since they control the 
exchanges in the bacterium. These exchanges apply to molecules of interest for the bacterium (small 
molecules such as sugars and small peptides) as well as undesirable molecules for the bacterium such 
as antibiotics or heavy metals. 

30 The double lipid layer structure of the membrane requires the proteins which are inserted 

therein to have hydrophobic domains of about twenty amino acids forming an alpha helix. 
Predominantly hydrophobic and potentially transmembrane regions may be predicted from the 
primary sequence of the proteins, itself deduced from the nucleotide sequence. The presence of one or 
more putative transmembrane domains raises the possibility for a protein to be associated with the 

35 cytoplasmic membrane and to be able to play an important metabolic role therein or alternatively for 
the protein thus exposed to be able to exhibit potentially protective epitopes. 

If the proteins inserted into the membrane exhibit several transmembrane domains 
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capable of interacting with one another via electrostatic bonds, it then becomes possible for 
these proteins to form pores which go across the membrane which becomes permeable for a number of 
substances. It should be noted that proteins which do not have transmembrane domains may also be 
anchored by the intermediacy of fatty acids in the cytoplasmic membrane, it being possmle for the 
5 breaking of the bond between the protein and its anchor in some cases to be responsible for the release 

of the peptide outside the bacterium. 

Preferably, the invention relates to the nucleotide sequences according to the invention, 
characterized in that they encode a Chlamydia pneumoniae transmembrane polypeptide or one of its 
representative fragments, having between 1 and 3 transmembrane domains and in that they compnse a 
10 nucleotide sequence chosen from the following sequences: 

ORF2- ORF3- ORF6; ORF9; ORF10; ORFll; ORF13; ORF14; ORF16; ORF18; ORF19; ORF20; 
ORF21- ORF22; ORF25; ORF27; ORF28; ORF29; ORF30; ORF31; ORF32; ORF33; ORF34; 
ORF35- ORF37; ORF39; ORF41; ORF42; ORF44; ORF45; ORF46; ORF47; ORF48; ORF49; 
ORF50- ORF53; ORF54; ORF56; ORF57; ORF59; ORF60; ORF61; ORF62; ORF63; ORF64; 
15 ORF65- ORF66; ORF69;; ORF72; ORF73; ORF74; ORF76; ORF77; ORF78; ORF79; ORF80; 
ORF82- ORF84; ORF85; ORF86; ORF88; ORF89; ORF90; ORF91; ORF92; ORF93; ORF95; 
ORF96- ORF98; ORF99; ORF100; ORF101; ORF102; ORF103; ORF104; ORF105; ORF106; 
ORF107; ORF108; ORF114; ORF117; ORF118; ORF122; ORF123; ORF124; ORF125; ORF129; 
ORF130- ORF131; ORF.32; ORF133; ORF134; ORF135; ORF137; ORF138; ORF139; ORF140; 
20 ORF141; ORF142; ORF143; ORF145; ORF146; ORF147; ORF150; ORF151; ORF152; ORF156; 
ORF157- ORF158; ORF159; ORF160; ORF161; ORF162; ORF164; ORF166; ORF167; ORF170; 
ORF173- ORF175; ORF176; ORF178; ORF179; ORF180; ORF182; ORF183; ORF184; ORF185; 
ORF186'- ORF187; ORF188; ORF189; ORF190; ORF191; ORF192; ORF194; ORF195; ORF196; 
ORF197- ORF198; ORF199; ORF200; ORF201; ORF202; ORF205; ORF207; ORF208; ORF209; 
25 ORF210- ORF212; ORF215; ORF219; ORF220; ORF224; ORF226; ORF227; ORF228; ORF231; 
ORF232- ORF233; ORF234; ORF235; ORF236; ORF238; ORF239; ORF240; ORF241; ORF242; 
ORF244- ORF247; ORF251; ORF252; ORF253; ORF255; ORF256; ORF257; ORF258; ORF260; 
ORF262- ORF263; ORF266; ORF267; ORF268; ORF269; ORF270; ORF273; ORF274; ORF276; 
ORF278- ORF279; ORF280; ORF281; ORF282; ORF283; ORF284; ORF286; ORF287; ORF289; 
30 ORF290- ORF291; ORF293; ORF294; ORF297; ORF304; ORF305; ORF307; ORF308; ORF309; 
ORF310- ORF311; ORF313; ORF314; ORF315; ORF316; ORF318; ORF319; ORF320; ORF321; 
ORF322- ORF323; ORF324; ORF325; ORF326; ORF331; ORF332; ORF336; ORF338; ORF339; 
ORF341- ORF344; ORF345; ORF346; ORF350; ORF352; ORF353; ORF356; ORF357; ORF358; 
ORF359- ORF360; ORF362; ORF365; ORF366; ORF367; ORF370; ORF372; ORF373; ORF376; 
35 ORF377- ORF378; ORF379; ORF381; ORF382; ORF383; ORF384; ORF385; ORF386; ORF387; 
ORF390- ORF392; ORF393; ORF394; ORF396; ORF398; ORF399; ORF400; ORF404; ORF408; 
ORF410- ORF411; ORF413; ORF416; ORF417; ORF418; ORF420; ORF422; ORF424; ORF427; 



WO 99/27105 PCT/IB98/01890 

21 

ORF428; ORF429; ORF430; ORF431; ORF433; ORF434; ORF437; ORF440; ORF441; ORF442; 

ORF443; ORF444; ORF445; ORF447; ORF450; ORF451; ORF452; ORF455; ORF456; ORF459; 

ORF460; ORF461; ORF462; ORF463; ORF464; ORF465; ORF467; ORF469; ORF471; ORF474; 

ORF475; ORF476; ORF477; ORF479; ORF482; ORF483; ORF484; ORF485; ORF486; ORF487; 
5 ORF488; ORF491; ORF493; ORF494; ORF497; ORF498; ORF499; ORF503; ORF508; ORF509; 

ORF510; ORF512; ORF514; ORF515; ORF516; ORF517; ORF518; ORF520; ORF521; ORF523; 

ORF525; ORF527; ORF528; ORF529; ORF530; ORF531; ORF533; ORF534; ORF535; ORF536; 

ORF537; ORF540; ORF541; ORF543; ORF544; ORF545; ORF546; ORF548; ORF549; ORF551; 

ORF553; ORF554; ORF555; ORF556; ORF557; ORF558; ORF559; ORF560; ORF562; ORF563; 
10 ORF564; ORF565; ORF566; ORF569; ORF571; ORF573; ORF576; ORF577; ORF581; ORF583; 

ORF584; ORF585; ORF586; ORF588; ORF591; ORF592; ORF594; ORF595; ORF596; ORF597; 

ORF599; ORF600; ORF603; ORF605; ORF608; ORF614; ORF615; ORF620; ORF621; ORF622; 

ORF623; ORF624; ORF625; ORF629; ORF630; ORF631; ORF633; ORF634; ORF637; ORF642; 

ORF644; ORF645; ORF647; ORF648; ORF652; ORF654; ORF655; ORF657; ORF658; ORF659; 
15 ORF660; ORF661; ORF664; ORF665; ORF666; ORF667; ORF670; ORF671; ORF672; ORF673; 

ORF674; ORF676; ORF679; ORF681; ORF684; ORF687; ORF688; ORF689; ORF690; ORF693; 

ORF694; ORF695; ORF696; ORF697; ORF698; ORF699; ORF700; ORF701; ORF703; ORF705; 

ORF706; ORF707; ORF708; ORF710; ORF712; ORF715; ORF716; ORF717; ORF718; ORF719; 

ORF721; ORF722; ORF723; ORF725; ORF726; ORF727; ORF728; ORF729; ORF730; ORF731; 
20 ORF733; ORF736; ORF737; ORF738; ORF740; ORF741; ORF742; ORF743; ORF747; ORF748; 

ORF750; ORF752; ORF754; ORF755; ORF756; ORF757; ORF759; ORF760; ORF761; ORF762; 

ORF763; ORF764; ORF765; ORF766; ORF767; ORF768; ORF772; ORF774; ORF775; ORF777; 

ORF781; ORF783; ORF788; ORF791; ORF792; ORF793; ORF794; ORF795; ORF796; ORF797; 

ORF798; ORF799; ORF802; ORF803; ORF806; ORF807; ORF808; ORF809; ORF810; ORF811; 
25 ORF813; ORF814; ORF815; ORF816; ORF817; ORF819; ORF820; ORF821; ORF823; ORF824; 

ORF827; ORF829; ORF830; ORF831; ORF833; ORF834; ORF835; ORF837; ORF844; ORF845; 

ORF846; ORF847; ORF848; ORF849; ORF850; ORF851; ORF852; ORF854; ORF855; ORF856; 

ORF857; ORF859; ORF860; ORF862; ORF865; ORF866; ORF868; ORF869; ORF870; ORF871; 

ORF872; ORF874; ORF877; ORF878; ORF879; ORF880; ORF881; ORF882; ORF884; ORF885; 
30 ORF888; ORF889; ORF890; ORF891; ORF892; ORF894; ORF895; ORF896; ORF897; ORF899; 

ORF900; ORF902; ORF903; ORF904; ORF905; ORF909; ORF910; ORF912; ORF913; ORF914; 

ORF915; ORF917; ORF918; ORF919; ORF921; ORF923; ORF924; ORF926; ORF927; ORF928; 

ORF929; ORF930; ORF931; ORF937; ORF938; ORF939; ORF941; ORF943; ORF948; ORF951; 

ORF952; ORF953; ORF958; ORF960; ORF963; ORF964; ORF965; ORF968; ORF970; ORF974; 
35 ORF975; ORF977; ORF979; GRF980; ORF981; ORF983; ORF984; ORF985; ORF987; ORF989: 

ORF992; ORF993; ORF997; ORF998; ORF999; ORF1001; ORF1002; ORF1004; ORF1005 

ORF1009; ORF1013; ORF1014; ORF1015; ORF1016; ORF1019; ORF1021; ORF1023; ORF1024 
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ORF1029; ORF1031; ORF1033; ORF1034; ORF1039; ORF1041; ORF1042; ORF1045 
ORF1047; ORF1049; ORF1051; ORF1052; ORF1053; ORF1054; ORF1056; ORF1059; ORF1061 
ORF1062; ORF1063; ORF1064; ORF1065; ORF1067; ORF1075; ORF1077; ORF1078; ORF1079 
ORF1080; ORF1081; ORF1089; ORF1095; ORF1097; ORF1098; ORF1099; ORF1101; ORF1102 
5 ORF1103; ORF1106; ORF1107; ORFU08; ORF1109; ORF1110; ORF1113; ORF1116; ORF1118 
ORF1119; ORF1121; ORF1123; ORFU24; ORF1126; ORF1128; ORF1130; ORF1131; ORF1133 
ORF1 134; ORF1 1 36; ORF1 137 and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the invention, 
characterized in that they encode a Chlamydia pneumoniae transmembrane polypeptide or one of its 
10 representative fragments, having between 4 and 6 transmembrane domains and in that they comprise a 
nucleotide sequence chosen from the following sequences: 

ORF5; ORF7; ORF8; ORF15; ORF36; ORF38; ORF51; ORF55; ORF58; ORF67; ORF70; ORF81 
ORF97; ORFHO; ORF111; ORF115; ORF119; ORF126; ORF128; ORF148; ORF155; ORF163 
ORF165; ORF168; ORF169; ORF171; ORF172; ORF174; ORF177; ORF181; ORF193; ORF203 
15 ORF213; ORF214; ORF216; ORF217; ORF221; ORF222; ORF225; ORF229; ORF243; ORF246 
ORF248; ORF254; ORF261; ORF285; ORF288; ORF292; ORF296; ORF298; ORF299; ORF301 
ORF303; ORF317; ORF328; ORF329; ORF351; ORF354; ORF355; ORF364; ORF371; ORF374: 
ORF375; ORF391; ORF395; ORF401; ORF403; ORF405; ORF409; ORF414; ORF419; ORF421 
ORF423; ORF425; ORF438; ORF448; ORF453; ORF458; ORF466; ORF468; ORF470; ORF480 
20 ORF489-, ORF490; ORF496; ORF501; ORF504; ORF505; ORF506; ORF511; ORF513; ORF519 
ORF526; ORF532; ORF538; ORF539; ORF547; ORF550; ORF561; ORF568; ORF570; ORF574: 
ORF578- ORF579; ORF580; ORF582; ORF589; ORF593; ORF598; ORF601; ORF604; ORF610: 
ORF613; ORF617; ORF626; ORF632; ORF635; ORF638; ORF640; ORF641; ORF646; ORF649 
ORF650-, ORF651; ORF686; ORF711; ORF724; ORF732; ORF734; ORF744; ORF745; ORF749 
25 ORF751; ORF769; ORF770; ORF771; ORF773; ORF776; ORF779; ORF780; ORF785; ORF787 
ORF789; ORF801; ORF805; ORF812; ORF822; ORF825; ORF826; ORF839; ORF841; ORF843 
ORF853-, ORF861; ORF875; ORF876; ORF886; ORF893; ORF898; ORF906; ORF907; ORF908 
ORF920; ORF922; ORF925; ORF933; ORF935; ORF936; ORF944; ORF946; ORF947; ORF954 
ORF959; ORF961; ORF966; ORF967; ORF972; ORF978; ORF995; ORF996; ORF1000; ORF1003 
30 ORF1010; ORF1011; ORF1012; ORF1017; ORF1020; ORF1030; ORF1036; ORF1038; ORF1043 
ORF1046; ORF1048; ORF1050; ORF1058; ORF1071; ORF1073; ORF1084; ORF1085; ORF1086 
ORF1087; ORF1091; ORF1092; ORF1094; ORF1096; ORF1100; ORF1104; ORF1111; ORF1112 
ORF1 1 14; ORF1 1 17; ORF1 122; ORF1 125 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
35 invention, characterized in that they encode a Chlamydia pneumoniae transmembrane polypeptide or 
one of its representative fragments, having at least 7 transmembrane domains and in that they 
comprise a nucleotide sequence chosen from the following sequences: 
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ORF17; ORF52; ORF68; ORF83; ORF87; ORF109; ORF112; ORF113; ORF120; ORF121; 
ORF127; ORF153; ORF204; ORF211; ORF218; ORF223; ORF275; ORF277; ORF295; ORF300; 
ORF302; ORF306; ORF327; ORF335; ORF342; ORF343; ORF347; ORF349; ORF361; ORF363; 
ORF369; ORF380; ORF388; ORF389; ORF397; ORF415; ORF432; ORF439; ORF446; ORF449; 
5 ORF472; ORF478; ORF500; ORF522; ORF524; ORF567; ORF575; ORF602; ORF606; ORF609; 
ORF636; ORF639; ORF643; ORF653; ORF668; ORF692; ORF702; ORF704; ORF713; ORF720; 
ORF778; ORF784; ORF800; ORF836; ORF838; ORF842; ORF864; ORF867; ORF883; ORF901; 
ORF916; ORF932; ORF934; ORF940; ORF942; ORF950; ORF956; ORF971; ORF973; ORF976; 
ORF988; ORF994; ORF1018; ORF1028; ORF1035; ORF1037; ORF1044; ORF1055; ORF1057; 

10 ORF1068; ORF1069; ORF1070; ORF1072; ORF1082; ORF1088; ORF1105; ORF1132; ORF1135 
and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the invention, 
characterized in that they encode a Chlamydia pneumoniae surface exposed polypeptide (e.g., an outer 
membrane protein) or one of its representative fragments, said nucleotide sequences comprising a 

1 5 nucleotide sequence chosen from the following sequences: 

ORF 15, ORF 25, ORF 26, ORF 27, ORF 28, ORF 29, ORF 30, ORF 3 1, ORF 32, ORF 33, ORF 35, 
ORF 36, ORF 1257, ORF 280, ORF 291, ORF 314, ORF 354, ORF 380, ORF 1266, ORF 466, ORF 
467, ORF 468, ORF 469, ORF 470, ORF 472, ORF 474, ORF 476, ORF 477, ORF 478, ORF 479, 
ORF 480, ORF 482, ORF 483, ORF 485, ORF 486, ORF 500, ORF 501, ORF 503, ORF 504, ORF 

2 0 505, ORF 506, ORF 507, ORF 1268, ORF 1269, ORF 543, ORF 544, ORF 578, ORF 579, ORF 580, 
ORF 581, ORF 595, ORF 596, ORF 597, ORF 1271, ORF 633, ORF 637, ORF 699, ORF 706, ORF 
737, ORF 744, ORF 1273, ORF 751, ORF 775, ORF 776, ORF 777, ORF 793, ORF 815, ORF 830, 
ORF 1221, ORF 849, ORF 851, ORF 852, ORF 874, ORF 891, ORF 922, ORF 940, ORF 1231, ORF 
1281, ORF 1035, ORF 1079, ORF 1087, ORF 1108, and one of their representative fragments. 

25 Preferably, the invention relates to the nucleotide sequences according to the invention, 

characterized in that they encode a Chlamydia pneumoniae lipoprotein or one of its representative 
fragments, said nucleotide sequences comprising a nucleotide sequence chosen from the following 
sequences: 

ORF 3, ORF 10, ORF 11, ORF 16, ORF 1254, ORF 1255, ORF 38, ORF 1256, ORF 62, ORF 85, 
30 ORF 1258, ORF 115, ORF 1151, ORF 151, ORF 1259, ORF 173, ORF 1261, ORF 186, ORF 194, 
ORF 205, ORF 214, ORF 216, ORF 217, ORF 238, ORF 1177, ORF 280, ORF 291, ORF 317, ORF 
327, ORF 354, ORF 364, ORF 367, ORF 414, ORF 432, ORF 1 192, ORF 460, ORF 1267, ORF 1268, 
ORF 520, ORF 536, ORF 1270, ORF 576, ORF 597, ORF 603, ORF 609, ORF 637, ORF 1272, ORF 
652, ORF 1213, ORF 699, ORF 705, ORF 706, ORF 708, ORF 71 1, ORF 727, ORF 1274, ORF 800, 
35 ORF 814, ORF 825, ORF 829, ORF 830, ORF 831, ORF 844, ORF 849, ORF 1275, ORF 1276, ORF 
1277, ORF 872, ORF 878, ORF 880, ORF 891, ORF 892, ORF 1278, ORF 1279, ORF 1280, ORF 
941, ORF 942, ORF 1282, ORF 1283, ORF 952, ORF 988, ORF 998, ORF 1009, ORF 1285, ORF 
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1235, ORF 1028, ORF 1056, ORF 1070, ORF 1287, ORF 1087, ORF 1288, ORF 1289, ORF 
1098, ORF 1246, ORF 1291, ORF 1108, ORF 1109, ORF 1112, ORF 1133, and one of their 

representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the invention, 
5 characterized in that they encode a Chlamydia pneumoniae polypeptide involved in lipopolysaccharide 
(LPS) biosynthesis, said nucleotide sequences comprising a nucleotide sequence chosen from the 
following sequences: ORF 316, ORF 564, ORF 610, ORF 647, ORF 121 1, ORF 688, ORF 924, and 

one of their representative fragments. 

Preferably the invention relates to additional LPS-related nucleotide sequences according 

10 to the invention, characterized in that they encode: 

(a) a Chlamydia pneumoniae KDO (3-deoxy-D-manno-octulosonic acid)-related 
polypeptide or one of its representative fragments, said nucleotide sequences comprising a nucleotide 
sequence chosen from the following sequences: ORF 177, ORF 1 156, ORF 245, ORF 767, and one of 

their representative fragments; 
15 (b) a Chlamydia pneumoniae phosphomannomutase-related polypeptide or one of its 

representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from the 
following sequences: ORF 74, and one of its representative fragments; 

(c) a Chlamydia pneumoniae phosphoglucomutase-related polypeptide or one of its 
representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from the 

2 0 following sequences: ORF 1286, ORF 1039, and one of their representative fragments; and 

(d) a Chlamydia pneumoniae lipid A component-related polypeptide or one of its 
representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from the 
following sequences: ORF 689, ORF 690, ORF 691, ORF 1037, and one of their representative 
fragments. 

25 Preferably, the invention relates to the nucleotide sequences according to the invention, 

^ characterized in that they encode a Chlamydia pneumoniae polypeptide containing ROD (Arg-Gly- 
Asp) attachment sites or one of its representative fragments. 

( a ) RGD^ontaining proteins that are outer membrane proteins, are more likely to play a 
role in cell attachment. ORFs that encoded a protein containing an RGD sequence 
and also were classified as outer membrane proteins are ORF 468 and its 
representative fragments. 

(b) An RGD-encoding ORF that showed homology to cdsl, cds2, and copN type ffl 
35 virulence loci in Chlamydia psittaci (Hsia, R. et al. (1997), Type Dl secretion genes 

identity a putative virulence locus of Chlamydia. Molecular Microbiology 25:351- 
359) is ORF 350, and its representative fragments. 
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(c)The outer membrane of Chlamydia is made of cysteine-rich proteins that form a 
network of both intra and inter molecular disulfide links. This contributes to the 
integrity of the membrane since Chlamydia lacks the peptidoglycan layer that other 
gram-negative bacteria have. Cysteine-rich proteins that have the RGD sequence are 
also considered to be potential vaccine candidates. Cysteine-rich proteins were 
defined as proteins that had more than 3.0% cysteine in their primary amino acid 
sequence, above the mean genomic ORF cysteine content. The corresponding ORFs 
are: ORF 1290, ORF 1294, ORF 1296, and one of their representative fragments. 



(d)The outer membrane of Chlamydia may also contain small proteins that have cysteines 
in their N- and C-terminus that may contribute to the network formed by disulfide 
linkages. These proteins may be anchored in the outer membrane via their N- 
terminus and may have their C-terminus exposed, which then can interact with the 

15 host cells. Alternatively, these proteins may be anchored in the outer membrane via 

both N-and C-terminus and may have regions in the middle that may be exposed 
which can in turn interact with the host cells. ORFs encoding polypeptides that 
contain cysteines in their first 30 amino acids and also contain an RGD sequence are: 
ORF 105, ORF 106, ORF 114, ORF 170, ORF 171, ORF 1264, ORF 268, ORF 

20 1265, ORF 350, ORF 393, ORF 394, ORF 451 , ORF 452, ORF 453, ORF 473, ORF 

499, ORF 515, ORF 519, ORF 525, ORF 526, ORF 538, ORF 611, ORF 645, ORF 
686, ORF 700, ORF 746, ORF 755, ORF 756, ORF 757, ORF 789, ORF 814, ORF 
855, ORF 856, ORF 878, ORF 957, ORF 958, ORF 989, ORF 1290, and one of their 
representative fragments. 



(e) RGD-containing ORFs homologous to RGD-containing ORFs from Chlamydia 
trachomatis are: 

ORF 114, ORF 468, ORF 755, ORF 756, ORF 757, ORF 855, ORF 856, ORF 905, 
ORF 913, ORF 914, ORF 915, and one of their representative fragments. 



Preferably, the invention relates to the nucleotide sequences according to the invention, 
characterized in that they encode a Chlamydia pneumoniae Type III or other, non-type III secreted 
polypeptide or one of its representative fragments, said nucleotide sequences comprising a nucleotide 
sequence chosen from the following sequences: 
35 ORF 25, ORF 28, ORF 29, ORF 33, ORF 308, ORF 309, ORF 343, ORF 344, ORF 345, ORF 367, 
ORF 414, ORF 415, ORF 480, ORF 550, ORF 579, ORF 580, ORF 581, ORF 597, ORF 699, ORF 
744, ORF 751, ORF 776, ORF 866, ORF 874, ORF 883, ORF 884, ORF 888, ORF 891, ORF 1293, 
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and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the invention, 
characterized in that they encode a Chlamydia pneumoniae cell wall anchored surface polypeptide or 
one of its representative fragments, said nucleotide sequences comprising a nucleotide sequence 
5 chosen from the following sequences: ORF 267, ORF 271, ORF 419, ORF 590, ORF 932, ORF 1292, 
ORF 1295, and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the invention, 
characterized in that they encode Chlamydia pneumoniae polypeptides not found in Chlamydia 
trachomatis (Blastp. P>e ,(> ), said nucleotide sequences comprising a nucleotide sequence chosen from 
10 the following sequences: ORF 7, ORF 8, ORF 9, ORF 16, ORF 17, ORF 18, ORF 19, ORF 20, ORF 
21, ORF 22, ORF 1254, ORF 23, ORF 1255, ORF 24, ORF 1 139, ORF 1 140, ORF 46, ORF 47, ORF 
51, ORF 60, ORF 1256, ORF 61, ORF 62, ORF 63, ORF 64, ORF 1257, ORF 65, ORF 66, ORF 67, 
ORF 68, ORF 1 143, ORF 1 145, ORF 83, ORF 84, ORF 1 146, ORF 85, ORF 86, ORF 87, ORF 1258, 
ORF 1 16, ORF 1 17, ORF 125, ORF 1 148, ORF 143, ORF 1 150, ORF 1 151, ORF 144, ORF 145, ORF 
15 147, ORF 148, ORF 149, ORF 150, ORF 152, ORF 1259, ORF 162, ORF 166, ORF 1154, ORF 167, 
ORF 1261, ORF 1156, ORF 1157, ORF 178, ORF 179, ORF 1158, ORF 182, ORF 183, ORF 184, 
ORF 185, ORF 1 159, ORF 186, ORF 1 160, ORF 187, ORF 188, ORF 189, ORF 190, ORF 1 161 , ORF 
1162, ORF 191, ORF 192, ORF 194, ORF 195, ORF 1163, ORF 196, ORF 201, ORF 202, ORF 209, 
ORF 212, ORF 221, ORF 224, ORF 1 167, ORF 226, ORF 227, ORF 228, ORF 229, ORF 230, ORF 
20 231, ORF 232, ORF 1169, ORF 1170, ORF 1171, ORF 234, ORF 235, ORF 236, ORF 1172, ORF 
243, ORF 25 1, ORF 252, ORF 1 176, ORF 253, ORF 255, ORF 254, ORF 256, ORF 1 177, ORF 1 178, 
ORF 262, ORF 263, ORF 1264, ORF 278, ORF 279, ORF 1 180, ORF 280, ORF 290, ORF 291, ORF 
292, ORF 296, ORF 1181, ORF 297, ORF 298, ORF 300, ORF 1265, ORF 322, ORF 324, ORF 325, 
ORF 370, ORF 1 186, ORF 371 , ORF 372, ORF 1 187, ORF 373, ORF 378, ORF 1266, ORF 382, ORF 
25 383, ORF 384, ORF 385, ORF 386, ORF 1188, ORF 1 189, ORF 391, ORF 392, ORF 398, ORF 400, 
ORF 403, ORF 1 191, ORF 423, ORF 435, ORF 445, ORF 450, ORF 1 193, ORF 456, ORF 460, ORF 
461, ORF 465, ORF 1196, ORF 471, ORF 473, ORF 475, ORF 481, ORF 484, ORF 487, ORF 488, 
ORF 489, ORF 490, ORF 491, ORF 492, ORF 493, ORF 494, ORF 495, ORF 496, ORF 497, ORF 
498, ORF 499, ORF 502, ORF 1267, ORF 1268, ORF 508, ORF 510, ORF 509, ORF 512, ORF 515, 
30 ORF 519, ORF 1 197, ORF 521, ORF 1 198, ORF 522, ORF 524, ORF 528, ORF 534, ORF 537, ORF 
1269, ORF 1270, ORF 548, ORF 551, ORF 557, ORF 1201, ORF 1203, ORF 562, ORF 566, ORF 
593, ORF 595, ORF 600, ORF 1271, ORF 604, ORF 611, ORF 612, ORF 614, ORF 616, ORF 625, 
ORF 627, ORF 628, ORF 629, ORF 631, ORF 641, ORF 1272, ORF 648, ORF 1212, ORF 663, ORF 
685, ORF 707, ORF 714, ORF 715, ORF 716, ORF 717, ORF 722, ORF 746, ORF 1273, ORF 761, 
35 ORF 764, ORF 770, ORF 1217, ORF 783, ORF 1274, ORF 803, ORF 815, ORF 1220, ORF 835, ORF 
1221, ORF 844, ORF 845, ORF 846, ORF 847, ORF 848, ORF 849, ORF 850, ORF 851, ORF 1275, 
ORF 852, ORF 862, ORF 1276, ORF 1277, ORF 873, ORF 1223, ORF 892, ORF 919, ORF 1225, 
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ORF 1278, ORF 926, ORF 1228, ORF 1229, ORF 1230, ORF 1279, ORF 1281, ORF 1282, 
ORF 1283, ORF 948, ORF 950, ORF 949, ORF 951, ORF 980, ORF 982, ORF 1233, ORF 999, ORF 
1000, ORF 1001, ORF 1002, ORF 1008, ORF 1285, ORF 1235, ORF 1016, ORF 1019, ORF 1027, 
ORF 1036, ORF 1241, ORF 1048, ORF 1049, ORF 1050, ORF 1053, ORF 1054, ORF 1064, ORF 
5 1076, ORF 1091, ORF 1288, ORF 1093, ORF 1289, ORF 1101, ORF 1103, ORF 1245, ORF 1246, 
ORF 1247, ORF 1290, ORF 1291, ORF 1115, ORF 1116, ORF 1118, ORF 1120, ORF 1249, ORF 
1121, ORF 1250, ORF 1126, ORF 1251, ORF 1127, ORF 1128, ORF 1130, ORF 1129, ORF 1131, 
ORF 1136, ORF 1253, ORF 1292, ORF 1294, ORF 1295, ORF 1296, and one of their representative 
fragments. 

10 Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of its 
representative fragments which is involved in the intermediate metabolism, in particular in the 
metabolism of sugars and/or of cofactors, such as for example triose phosphate isomerase or pyruvate 
kinase, and in that they comprise a nucleotide sequence chosen from the following sequences: 

15 ORF2; ORF55; ORF56; ORF69; ORF75; ORF80; ORF100; ORF110; ORF114; ORF120; ORF121; 
ORF157; ORF160; ORF161; ORF172; ORF180; ORF181; ORF198; ORF200; ORF225; ORF248; 
ORF249; ORF276; ORF277; ORF318; ORF319; ORF320; ORF323; ORF331; ORF347; ORF375; 
ORF376; ORF381; ORF393; ORF394; ORF395; ORF396; ORF409; ORF446; ORF447; ORF448; 
ORF449; ORF513; ORF516; ORF571; ORF647; ORF662; ORF697; ORF718; ORF793; ORF794; 

20 ORF808; ORF809; ORF838; ORF839; ORF840; ORF853; ORF854; ORF918; ORF923; ORF929; 
ORF931; ORF938; ORF939; ORF958; ORF959; ORF960; ORF966; ORF995; ORF1021; ORF1040; 
ORF1041; ORF1042; ORF1085; ORF1100; ORF1102; ORF1117; ORF1118; ORF1119; ORF1120; 
ORF1 135 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

25 invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of its 
representative fragments which is involved in the intermediate metabolism of nucleotides or nucleic 
acids, such as for example CTP synthetase or GMP synthetase, and in that they comprise a nucleotide 
sequence chosen from the following sequences: 

ORF77; ORF78; ORF138; ORF189; ORF190; ORF233; ORF246; ORF338; ORF412; ORF421; 
30 ORF438; ORF607; ORF648; ORF657; ORF740; ORF783; ORF967; ORF989; ORF990; ORF992; 

ORF101 1 ; ORF1058; ORF1059; ORF1073; ORF1074 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of its 

representative fragments which is involved in the metabolism of nucleic acids, such as for example 
35 DNA polymerases or DNA topoisomerases, and in that they comprise a nucleotide sequence chosen 

from the following sequences: 

ORF14; ORF59; ORF70; ORF71; ORF97; ORF1 13; ORF137; ORF141; ORF169; ORF285; ORF287; 
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ORF288; ORF313; ORF326; ORF358; ORF411; ORF443; ORF548; ORF569; ORF601; ORF651; 
ORF654 ORF658; ORF659; ORF664; ORF665; ORF694; ORF698; ORF704; ORF760; ORF762; 
ORF763'- ORF786; ORF787; ORF788; ORF801; ORF802; ORF812; ORF819; ORF822; ORF870; 
ORF897- ORF898; ORF902; ORF908; ORF9.6; ORF954; ORF955; ORF961; ORF983; ORF996; 
5 ORF1007; ORF1012; ORF1013; ORF1014; ORF1015; ORF1038; ORF1137 and one of the* 

representative fragments. 

Preferably, the invention also relates to the nucleotide sequences accordmg to the 
invention characterized in that they encode a Chlamydia pneumoniae polypeptide or one of tt 
representative fragments which is involved in the metabolism of amino acids or polypeptides, such as 
10 for example serine hydroxymethyl transferase or the proteins which load amino acids onto transfer 
RNAs, and in that they comprise a nucleotide sequence chosen from the following sequences: 
ORF99- ORFlll; ORF127; ORF134; ORF140; ORF174; ORF175; ORF176; ORF353; ORF377; 
ORF404- ORF523; ORF539; ORF559; ORF561; ORF586; ORF598; ORF609; ORF636; ORF687; 
ORFTW ORF701; ORF759; ORF790; ORF857; ORF861; ORF904; ORF936; ORF952; ORF962; 
15 ORF963- ORF964; ORF965; ORF991; ORF1003; ORF1004; ORF1005; ORF1018; ORF1067; 
OR^HO; ORFllll; ORF1H2; ORF1H4; ORF1121; ORF1122; ORF1123; ORF1124; ORF1125 
and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences accordmg to the 
invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of »ts 
20 representative fragments which is involved in the metabolism of peptides, such as for examp e 
protein kinases or proteases, and in that they comprise a nucleotide sequence chosen from the 

following sequences: AD mcn. 
ORF4- ORF44; ORF45; ORF48; ORF54; ORFlll ORF130; ORF155; ORF163; ORF212; ORF257, 
ORF30, ; ORF343; OIt F40 5 ; ORP4.6; ORF45S; ORF540; ORF541-, ORF542; ORF543; ORF544; 
25 ORF560- ORF594-, ORF652; ORF699; ORF723; ORF747; ORF8.7; ORP827; ORFS7.; ORF909 ; 
ORF910; ORF911; ORF912; ORF.023; ORF1051; ORF!052; ORF1081 and one of .her 

representative fragments. 

Preferably, the invention also relates to the nucleotide sequences accordmg to the 
invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of .|s 
30 representative fragments which is invo.ved in the metabohsm of fatty acids, such as for example 
succinyl-CoA-synthesizing proteins or phosphatidylserine synthetase, and in that they compnse a 
nucleotide sequence chosen from the following sequences: 

ORF76- ORF284, ORF308; ORF309; ORF310; ORF311; ORF312; ORF425; ORF433; ORF565; 
ORF688- ORF690; ORF691; ORF767; ORF797; ORF894; ORF895; ORF994; ORF1020; ORF1030; 
35 ORF1033- ORF1034; ORF1046; ORF1047; ORF1057 and one of their representatwe fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of rts 
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representative fragments which is involved in the synthesis of the wall, such as for example 
KDO transferase, and the proteins responsible for the attachment of certain sugars onto the exposed 
proteins, and in that they comprise a nucleotide sequence chosen from the following sequences: 
ORF49; ORF50; ORF177; ORF178; ORF245; ORF610; ORF972; ORF974; ORF978; ORF1037 and 
5 one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of its 
representative fragments which is involved in the transcription, translation and/or maturation process, 
such as for example initiation factors, RNA polymerases or certain chaperone proteins, and in that 

10 they comprise a nucleotide sequence chosen from the following sequences: 

ORF90; ORF92; ORF131; ORF151; ORF199; ORF333; ORF334; ORF336; ORF379; ORF589; 
ORF590; ORF619; ORF630; ORF649; ORF739; ORF741; ORF806; ORF821; ORF843; ORF968; 
ORF971; ORF1061 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

15 invention, characterized in that they encode a Chlamydia pneumoniae ribosomal polypeptide or one of 
its representative fragments, such as for example the ribosomal proteins L21, L27 and S10, and in that 
they comprise a nucleotide sequence chosen from the following sequences: 

ORF93; ORF94; ORF95; ORF136; ORF259; ORF332; ORF348; ORF583; ORF584; ORF588; 
ORF591; ORF592; ORF663; ORF666; ORF667; ORF669; ORF670; ORF671; ORF672; ORF673; 
2 0 ORF674; ORF675; ORF676; ORF677; ORF678; ORF679; ORF680; ORF681; ORF683; ORF684; 
ORF738; ORF781; ORF1008; ORF1024; ORF1025; ORF1066 and one of their representative 
fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia pneumoniae transport polypeptide or one of 
25 its representative fragments, such as for example the proteins for transporting amino acids, sugars and 
certain oligopeptides, and in that they comprise a nucleotide sequence chosen from the following 
sequences: 

ORF40; ORF41; ORF52; ORF105; ORF106; ORF107; ORF109; ORF133; ORF210; ORF211; 

ORF214; ORF215; ORF216; ORF217; ORF218; ORF219; ORF220; ORF223; ORF242; ORF260; 
30 ORF293; ORF299; ORF366; ORF369; ORF575; ORF602; ORF638; ORF639; ORF640; ORF643; 

ORF653; ORF702; ORF703; ORF724; ORF732; ORF855; ORF856; ORF901; ORF906; ORF933; 

ORF942; ORF1043; ORF1086; ORF1 105 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of its 
35 representative fragments which is involved in the virulence process, such as for example the proteins 

analogous to the Escherichia coll vacB protein, and in that they comprise a nucleotide sequence 

chosen from the following sequences: 
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ORF546; ORF550; ORF778; ORF779; ORF886 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia pneumoniae polypeptide or one of its 
representative fragments which is involved in the secretory system and/or which is secreted, such as 
5 for example proteins homologous to proteins in the secretory system of certain bacteria such as the 
Salmonellae or the Yersiniae, and in that they comprise a nucleotide sequence chosen from the 
following sequences: 

ORF751; ORF874; ORF875; ORF876; ORF883; ORF884; ORF885 and one of their representative 
fragments. 

10 Preferably, the invention also relates to a nucleotide sequence according to the invention, 

characterized in that they encode a polypeptide specific to Chlamydia pneumoniae or one of its 
representative fragments (with a Blast E value of >10" 5 ), and in that they comprise a nucleotide 
sequence chosen from the following sequences: 

ORF7- ORF8- ORF17; ORF18; ORF19; ORF20; ORF22; ORF23; ORF24; ORF51; ORF60; ORF63 
15 ORF65- ORF66; ORF67; ORF83; ORF84; ORF86; ORF87; ORF125; ORF143; ORF144; ORF179 
ORF182- ORF184; ORF185; ORF187; ORF221; ORF252; ORF254;; ORF278; ORF279; ORF387 
ORF388; ORF397; ORF1048; ORF1049; ORF1050; ORF1 128; ORF1130; ORF1 131 and one of the,r 

representative fragments. 

Also forming part of the invention are polypeptides encoded by the polynucleotides of 
20 the invention, as well as fusion polypeptides comprising such polypeptides. In one embodiment, the 
polypeptides and fusion polypeptides immunoreact with seropositive serum of an individual mfected 
with Chlamydia pneumoniae. For example, described below, are polypeptide sequences exmbnmg 
particularly preferable characteristics. For each group of preferred polypeptides described below, ,t u 
to be understood that in addition to the individual polypeptides listed, in instances wherem such 
25 polypeptides are encoded as part of "combined" ORFs, such "combined" polypeptides are also to be 

included within the preferred group. 

The subject of the invention is also a polypeptide according to the invention, 
characterized in that it is a polypeptide of the cellular envelope, preferably of the outer cellular 
envelope, of Chlamydia pneumoniae or one of its representative fragments. Accord.ng to the 
30 invention, the said polypeptide is preferably chosen from the polypeptides having the followmg 
sequences: 

SEQ ID No 15; SEQ ID No. 25; SEQ ID No. 26; SEQ ID No. 27; SEQ ID No. 28; SEQ ID No. 29; 
SEQ ID No 30; SEQ ID No. 31; SEQ ID No. 32; SEQ ID No. 33; SEQ ID No. 35; SEQ ID No. 68; 
SEQ ID No 124; SEQ ID No. 275; SEQ ID No. 291; SEQ ID No. 294; SEQ ID No. 327; SEQ ID 
35 No 342- SEQ ID No.364; SEQ ID No. 374; SEQ ID No. 380; SEQ ID No. 414; SEQ ID No. 439; 
SEQ ID No 466; SEQ ID No. 467; SEQ ID No. 468; SEQ ID No. 469; SEQ ID No. 470; SEQ ID 
No 472- SEQ ID No. 474; SEQ ID No. 476; SEQ ID No. 477; SEQ ID No. 478; SEQ ID No. 479; 



WO 99/27105 PCT/IB98/01890 

31 

SEQID No. 480; SEQID No. 482; SEQID No. 485; SEQID No. 500; SEQID No. 501; 

SEQ ID No. 503; SEQ ID No. 504; SEQ ID No. 505; SEQ ID No. 506; SEQ ID No. 520; SEQ ID 

No. 578; SEQ ID No. 580; SEQ ID No. 581; SEQ ID No. 595; SEQ ID No. 596; SEQ ID No. 597; 

SEQ ID No. 737; SEQ ID No. 830; SEQ ID No. 834; SEQ ID No. 836; SEQ ID No. 893; SEQ ID 
5 No. 917; SEQ ID No. 932; SEQ ID No. 976; SEQ ID No. 1035; SEQ ID No. 1045; SEQ ID No. 1090 

and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia pneumoniae transmembrane polypeptide or one of its 

representative fragments, having between 1 and 3 transmembrane domains, and in that it is chosen 
1 0 from the polypeptides having the following sequences: 

SEQID No. 2; SEQID No. 3; SEQID No. 6; SEQID No. 9; SEQID No. 10; SEQID No. 11; 

SEQ ID No. 13; SEQ ID No. 14; SEQ ID No. 16; SEQ ID No. 18; SEQ ID No. 19; SEQ ID No. 20; 

SEQID No. 21; SEQID No. 22; SEQID No. 25; SEQID No. 27; SEQID No. 28; SEQID 

No. 29; SEQID No. 30; SEQID No. 31; SEQID No. 32; SEQID No. 33; SEQID No. 34; 
15 SEQID No. 35; SEQID No. 37; SEQID No. 39; SEQID No. 41; SEQID No. 42; SEQID 

No. 44; SEQID No. 45; SEQID No. 46; SEQID No. 47; SEQID No. 48; SEQID No. 49; 

SEQID No. 50; SEQID No. 53; SEQID No. 54; SEQID No. 56; SEQID No. 57; SEQID 

No. 59; SEQID No. 60; SEQID No. 61; SEQID No. 62; SEQID No. 63; SEQID No. 64; 

SEQID No. 65; SEQID No. 66; SEQID No. 69;; SEQID No. 72; SEQID No. 73; SEQID 
20 No. 74; SEQID No. 76; SEQID No. 77; SEQID No. 78; SEQID No. 79; SEQID No. 80; 

SEQID No. 82; SEQID No. 84; SEQID No. 85; SEQID No. 86; SEQID No. 88; SEQID 

No. 89; SEQID No. 90; SEQID No. 91; SEQID No. 92; SEQID No. 93; SEQID No. 95; 

SEQID No. 96; SEQ ID No. 98; SEQID No. 99; SEQID No. 100; SEQID No. 101; SEQID 

No. 102; SEQ ID No. 103; SEQ ID No. 104; SEQ ID No. 105; SEQ ID No. 106; SEQ ID No. 107; 
25 SEQID No. 108; SEQID No. 114; SEQID No. 117; SEQID No. 118; SEQID No. 122; SEQID 

No. 123; SEQ ID No. 124; SEQ ID No. 125; SEQ ID No. 129; SEQ ID No. 130; SEQ ID No. 131; 

SEQ ID No. 132; SEQ ID No. 133; SEQ ID No. 134; SEQ ID No. 135; SEQ ID No. 137; SEQ ID 

No. 138; SEQ ID No. 139; SEQ ID No. 140; SEQ ID No. 141; SEQ ID No. 142; SEQ ID No. 143; 

SEQ ID No. 145; SEQ ID No. 146; SEQ ID No. 147; SEQ ID No. 150; SEQ ID No. 151; SEQ ID 
30 No. 152; SEQ ID No. 156; SEQ ID No. 157; SEQ ID No. 158; SEQ ID No. 159; SEQ ID No. 160; 

SEQ ID No. 161; SEQ ID No. 162; SEQ ID No. 164; SEQ ID No. 166; SEQ ID No. 167; SEQ ID 

No. 170; SEQ ID No. 173; SEQ ID No. 175; SEQ ID No. 176; SEQ ID No. 178; SEQ ID No. 179; 

SEQ ID No. 180; SEQ ID No. 182; SEQ ID No. 183; SEQ ID No. 184; SEQ ID No. 185; SEQ ID 

No. 186; SEQ ID No. 187; SEQ ID No. 188; SEQ ID No. 189; SEQ ID No. 190; SEQ ID No. 191; 
35 SEQ ID No. 192; SEQ ID No: 194; SEQ ID No. 195; SEQ ID No. 196; SEQ ID No. 197; SEQ ID 

No. 198; SEQ ID No. 199; SEQ ID No. 200; SEQ ID No. 201; SEQ ID No. 202; SEQ ID No. 205; 

SEQ ID No. 207; SEQ ID No. 208; SEQ ID No. 209; SEQ ID No. 210; SEQ ID No. 212; SEQ ID 
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No 215- SEQID No. 2,9; SEQID No. 220; SEQID No. 224; SEQID No. 226; SEQID 
No 227^ SEQ ID No. 228; SEQ ID No. 231; SEQID No. 232; SEQ ID No. 233; SEQ ID No. 234; 
SEQ ID No. 235; SEQ ID No.236;SEQID No. 238; SEQID No. 239; SEQ ID No.240;SE Q m 
NO 24,;SEQID No.242;SEQlD No.244;SEQID No.247;SEQID N„.25.;SEQID No. 25* 
, SEQID No.253.SEQU> No.255;SEQID No.256;SEQID N„.257;SEQD3 No. 258; SEQ ID 
No 260; SEQ ID No.262; SEQID No.263;SEQm No.266;SEQ>D No.267;SEQK> N.268; 
«» No*,; SEQ* No.270;SEQE3 No. 273; SEQ ID N„.2,4;SEQID No^; SEQID 
No 278- SEQ ID N„.279;SEQID N„.280;SEQ.D No.28,;SEQ,D N„.282;SEQ,D No. 283. 
SEQ ID No 284; SEQ ID No. 286; SEQ ID No. 287; SEQ ID No. 289; SEQ ID No. 290; SEQ U> 
1. No 291; SEQ ID No.293;SEQID N„.294; SEQID No. 297; SEQ ID No.304.SEQD No 305; 
SEQ ID No 307; SEQ ID N..308; SEQID No.309;SE Q m No.310;SEQ,D No.31.;SEQID 
No 313- SEQ ID No.3,4;SEQID No.3,5;SEQID No. 316; SEQID N..3 18 ;SEQ1D No^3I9; 
SEQ ID No. 320; SEQ ID No.32,;SEQ,D No.322;SEQ,D N„.323;SEQID No324;SEQ03 
No 325' SEQ ID No.326; SEQID N„.33,;SEQ1D N„.332;SEQ1D No. 336; SEQ ID No.338 
15 SEQ ID No. 339; SEQ ID No.34.;SEQID No.344;SEQID N„.345;SEQID 

No 350; SEQID N0.352..SEQ.D No.353;SE Q m No.356;SEQID N„.357;SEQID N„.3» 
SEQID No. 359; SEQID N„.360.SEQm No. 362; SEQ ID No.365;SEQID 
No 367- SEQ ID No. 370; SEQ ID No. 372; SEQ ID No. 373; SEQ ID No. 376; SEQ ID No^7; 
SEQ ID No 378; SEQ ID No. 379; SEQ ID N„.38,.SEQID No. 382; SEQ*, 
2 0 No 384; SEQ ID No. 385; SEQ ID No. 386; SEQ ID No. 387; SEQ ID No.390; S EQID No.3,2. 
SEQ ID No 393; SEQ ID N„.3,4;SEQ,D No. 396; SEQ ID No. 398; SEQ ID ******* 
No 400- SEQ ID No. 404; SEQ ID No. 408; SEQ ID No. 410; SEQ ID No. 411; SEQID No^3. 

«JU«.> N0.4„;SEQn, N„.4,8;SEQ03 No.420.SEQn, 
No 424; SEQ ID No. 427; SEQID No. 428; SEQ ID No. 429; SEQ ID No. 430; SEQ ID No^3; 
25 SEQID No. 433; SEQ ID No. 434; SEQ ID No. 437; SEQ ID No. 440; SEQ ID No. 441; SEQ ID 
NO 442; SEQID No.443;SEQID No.444;SEQ>D No. 445; SEQ E> No.44,;SEQID No. 450; 
SEQID No. 451; SEQID No. 452; SEQ ID No. 455; SEQ ID No. 456; SEQ ID IkAW 
No 460; SEQ ID N„.46,;SEQID N„.462;SE Q n3 N„.463;SEQn> N..464;SEQn> No.465; 
SEQID N„.467;SEQID No. 469; SEQ ID N„.471;SEQID N„.474;SEQID No. 4,5; SEQ ID 
30 No 476; SEQ ID No.477;SEQID No. 479; SEQ ID No. 482; SEQ ID No. 483; SEQ ID N„^484; 
SEQID No. 485; SEQ ID No. 486; SEQ ID No. 487; SEQ ID No. 488; SEQ ID No. 491; SEQ ID 
No 493- SEQ ID No. 494; SEQ ID No. 497; SEQ ID No. 498; SEQ ID No. 499; SEQ ID No. 503; 
SEQ ID No. 508; SEQ ID No. 509; SEQ ID No. 510; SEQID No. 512; SEQID No.5.4;SEQID 
No 5.5- SEQID No.5,6;SEQID No. 5,7; SEQ ID No. 518; SEQID No. 520; SEQ ID No. 521; 
35 SEQID No. 523; SEQ ID No,525;SEQID No.527; SEQID No 528; SEQ ID No. 529; SEQ ID 
No 530. SEQ ID No. 531; SEQ ID No. 533; SEQ ID No. 534; SEQ ID No. 535; SEQ ID No. 536; 
SEQ tt>' No 537; SEQ ID No.540;SEQID No.541;SEQID No. 543; SEQ ID No. 544; SEQ ID 
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No. 545; SEQID No. 546; SEQID No. 548; SEQ ID No. 549; SEQID No. 551; SEQID 
No. 553; SEQ ID No. 554; SEQ ID No. 555; SEQ ID No. 556; SEQ ID No. 557; SEQ ID No. 558; 
SEQ ID No. 559; SEQ ID No. 560; SEQ ID No. 562; SEQ ID No. 563; SEQ ID No. 564; SEQ ID 
No. 565; SEQ ID No. 566; SEQ ID No. 569; SEQ ID No. 571; SEQ ID No. 573; SEQ ID No. 576; 
5 SEQ ID No. 577; SEQ ID No. 581; SEQ ID No. 583; SEQ ED No. 584; SEQ ED No. 585; SEQ ID 
No. 586; SEQ ED No. 588; SEQ ED No. 591; SEQ ED No. 592; SEQ ED No. 594; SEQ ID No. 595; 
SEQ ID No. 596; SEQ ID No. 597; SEQ ID No. 599; SEQ ID No. 600; SEQ ID No. 603; SEQ ID 
No. 605; SEQ ID No. 608; SEQ ID No. 614; SEQ ID No. 615; SEQ ID No. 620; SEQ ID No. 621; 
SEQ ID No. 622; SEQ ID No. 623; SEQ ID No. 624; SEQ ID No. 625; SEQ ID No. 629; SEQ ID 

10 No. 630; SEQ ID No. 631 ; SEQ ID No. 633; SEQ ID No. 634; SEQ ID No. 637; SEQ ID No. 642; 
SEQ ID No. 644; SEQ ID No. 645; SEQ ID No. 647; SEQ ID No. 648; SEQ ID No. 652; SEQ ID 
No. 654; SEQ ID No. 655; SEQ ID No. 657; SEQ ID No. 658; SEQ ID No. 659; SEQ ID No. 660; 
SEQ ID No. 661; SEQ ID No. 664; SEQ ID No. 665; SEQ ED No. 666; SEQ ID No. 667; SEQ ID 
No. 670; SEQ ID No. 671 ; SEQ ID No. 672; SEQ ID No. 673; SEQ ID No. 674; SEQ ID No. 676; 

15 SEQID No. 679; SEQ ID No. 681; SEQ ID No. 684; SEQ ID No. 687; SEQ ID No. 688; SEQ ID 
No. 689; SEQ ID No. 690; SEQ ED No. 693; SEQ ID No. 694; SEQ ID No. 695; SEQ ID No. 696; 
SEQID No. 697; SEQ ID No. 698; SEQ ID No. 699; SEQ ID No. 700; SEQ ID No. 701; SEQ ID 
No. 703; SEQ ID No. 705; SEQ ID No. 706; SEQ ID No. 707; SEQ ID No. 708; SEQ ID No. 710; 
SEQID No. 712; SEQ ID No. 715; SEQ ID No. 716; SEQID No. 717; SEQ ID No. 718; SEQ ID 

20 No. 719; SEQ ID No. 721; SEQ ID No. 722; SEQ ID No. 723; SEQ ID No. 725; SEQ ID No. 726; 
SEQID No. 727; SEQ ID No. 728; SEQ ID No. 729; SEQ ID No. 730; SEQ ID No. 731; SEQ ID 
No. 733; SEQ ID No. 736; SEQ ID No. 737; SEQ ID No. 738; SEQ ID No. 740; SEQ ID No. 741; 
SEQID No. 742; SEQ ID No. 743; SEQ ID No. 747; SEQ ID No. 748; SEQ ID No. 750; SEQ ID 
No. 752; SEQ ED No. 754; SEQ ID No. 755; SEQ ID No. 756; SEQ ED No. 757; SEQ ID No. 759; 

25 SEQID No. 760; SEQ ID No. 761; SEQ ID No. 762; SEQ ID No. 763; SEQ ID No. 764; SEQ ID 
No. 765; SEQ ID No. 766; SEQ ID No. 767; SEQ ID No. 768; SEQ ID No. 772; SEQ ID No. 774; 
SEQID No. 775; SEQ ID No. 777; SEQ ID No. 781; SEQ ID No. 783; SEQ ED No. 788; SEQ ID 
No. 791; SEQ ID No. 792; SEQ ID No. 793; SEQ ID No. 794; SEQ ID No. 795; SEQ ID No. 796; 
SEQ ID No. 797; SEQ ID No. 798; SEQ ID No. 799; SEQ ED No. 802; SEQ ID No. 803; SEQ ID 

30 No. 806; SEQ ID No. 807; SEQ ID No. 808; SEQ ID No. 809; SEQ ID No. 810; SEQ ID No. 81 1; 
SEQID No. 813; SEQ ID No. 814; SEQ ID No. 815; SEQ ID No. 816; SEQID No. 817; SEQ ID 
No. 819; SEQ ID No. 820; SEQ ID No. 821; SEQ ID No. 823; SEQ ID No. 824; SEQ ID No. 827; 
SEQ ID No. 829; SEQ ID No. 830; SEQ ID No. 831; SEQ ID No. 833; SEQ ID No. 834; SEQ ID 
No. 835; SEQ ID No. 837; SEQ ID No. 844; SEQ ID No. 845; SEQ ID No. 846; SEQ ID No. 847; 

35 SEQ ID No. 848; SEQ ID No. 849; SEQ ID No. 850; SEQ ID No. 851; SEQ ID No. 852; SEQ ID 
No. 854; SEQ ID No. 855; SEQ ID No. 856; SEQ ID No. 857; SEQ ID No. 859; SEQ ID No. 860; 
SEQ ID No. 862; SEQ ID No. 865; SEQ ID No. 866; SEQ ID No. 868; SEQ ID No. 869; SEQ ID 



PCT/1B98/01890 

WO 99/27105 

34 

No 870- SEQID No. 871; SEQID No. 872; SEQID No.874; SEQID No. 877; SEQID 
No 878; SEQID No. 879; SEQID No. 880; SEQID No. 881; SEQID No. 882; SEQID No. 884; 
SEQ ID No. 885; SEQ ID No. 888; SEQ ID No. 889; SEQ ID No. 890; SEQ ID No. 891; SEQ ID 
No 892- SEQ ID No. 894; SEQ ID No. 895; SEQ ID No. 896; SEQ ID No. 897; SEQ ID No. 899; 
5 SEQ ID No. 900; SEQ ID No. 902; SEQ ID No. 903; SEQ ID No. 904; SEQ ID No. 905; SEQ ID 
No 909- SEQ ID No. 910; SEQ ID No. 912; SEQ ID No. 913; SEQ ID No. 914; SEQ ID No. 915; 
SEQID No.917; SEQID No.918; SEQID No. 919; SEQ ID Na 921; SEQ ID No. 923; SEQ ID 
No 924; SEQ ID No. 926; SEQ ID No. 927; SEQ ID No. 928; SEQ ID No. 929; SEQ ID No. 930; 
SEQID No 931; SEQ ID No. 937; SEQ ID No. 938; SEQ ID No. 939; SEQ ID No. 941; SEQID 
10 No 943- SEQ ID No. 948; SEQ ID No. 951; SEQ ID No. 952; SEQ ID No.953; SEQID No. 958; 
SEQ ID No 960; SEQ ID No. 963; SEQ ID No. 964; SEQ ID No. 965; SEQ ID No. 968; SEQ ID 
No 970- SEQ ID No. 974; SEQ ID No. 975; SEQ ID No. 977; SEQ ID No. 979; SEQ ID No. 980; 
SEQID No 981; SEQID No. 983; SEQ ID No. 984; SEQ ID No. 985; SEQ ID No. 987; SEQ ID 
No 989- SEQ ID No. 992; SEQ ID No. 993; SEQ ID No. 997; SEQ ID No. 998; SEQ ID No. 999 
15 SEQID No. 1001; SEQID No. 1002; SEQ ID No. 1004; SEQID No. 1005; SEQ ID No. 1009 
SEQID No. 1013; SEQID No. 1014; SEQ ID No. 1015; SEQ ID No. 1016; SEQ ID No. 1019 
SEQID No 1021; SEQID No. 1023; SEQID No. 1024; SEQID No. 1029; SEQID No. 1031 
SEQID No. 1033; SEQID No. 1034; SEQID No. 1039; SEQID No. 1041; SEQID No. 1042 
SEQID No. 1045; SEQID No. 1047; SEQ ID No. 1049; SEQ ID No. 1051; SEQ ID No. 1052 
20 SEQID No. 1053; SEQID No. 1054; SEQID No. 1056; SEQID No. 1059; SEQID No. 1061 
SEQID No 1062; SEQID No. 1063; SEQID No. 1064; SEQ ID No. 1065; SEQID No. 1067 
SEQID No. 1075; SEQID No. 1077; SEQ ID No. 1078; SEQ ID No. 1079; SEQ ID No. 1080 
SEQID No. 1081; SEQID No. 1089; SEQ ID No. 1095; SEQ ID No. 1097; SEQ ID No. 1098 
SEQID No. 1099; SEQID No. 1101; SEQ ID No. 1102; SEQ ID No. 1 103; SEQ ID No. 1106 
25 SEQID No. 1107; SEQID No. 1 108; SEQ ID No. 1 109; SEQ ID No. 1 1 10; SEQ ID No. 1113 
SEQID No. 1116; SEQID No. 1118; SEQID No. 1119; SEQID No. 1121; SEQ ID No. 1123 
SEQID No. 1124; SEQID No. 1126; SEQ ID No. 1 128; SEQ ID No. 1130; SEQID No. 1131 
SEQID No. 1133; SEQID No. 1134; SEQID No. 1136; SEQID No. 1137 and one of their 
representative fragments. 

30 Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia pneumoniae transmembrane polypeptide or one of its respective 
fragments, having between 4 and 6 transmembrane domains, and in that it is chosen from the 
polypeptides having the following sequences: 

SEQID No. 5; SEQ ID No. 7; SEQ ID No. 8; SEQ ID No. 15; SEQID No. 36; SEQ ID No. 38; 
35 SEQID No. 51; SEQID No. 55; SEQID No. 58; SEQID No. 67; SEQID No. 70; SEQID 
No 81; SEQID No. 97; SEQID No. 110; SEQID No. Ill, SEQID No. 115; SEQID No. 119; 
SEQ ID No. 126; SEQ ID No. 128; SEQ ID No. 148; SEQ ID No. 155; SEQ ID No. 163; SEQ ID 
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No. 165; SEQID No. 168; SEQID No. 169; SEQID No. 171; SEQID No. 172; SEQID 
No. 174; SEQ ID No. 177; SEQ ID No. 181; SEQ ID No. 193; SEQ ID No. 203; SEQ ID No. 213; 
SEQID No. 214; SEQ ID No. 216; SEQ ID No. 217; SEQ ID No. 221; SEQ ID No. 222; SEQ ID 
No. 225; SEQ ID No. 229; SEQ ID No. 243; SEQ ID No. 246; SEQ ID No. 248; SEQ ID No. 254; 
5 SEQID No. 261; SEQ ID No. 285; SEQ ID No. 288; SEQ ID No. 292; SEQ ID No. 296; SEQ ID 
No. 298; SEQ ID No. 299; SEQ ID No. 301; SEQ ID No. 303; SEQ ID No. 317; SEQ ID No. 328; 
SEQ ID No. 329; SEQ ID No. 351; SEQ ID No. 354; SEQ ID No. 355; SEQ ID No. 364; SEQ ID 
No. 371; SEQ ID No. 374; SEQ ID No. 375; SEQ ID No. 391; SEQ ID No. 395; SEQ ID No. 401; 
SEQ ID No. 403; SEQ ID No. 405; SEQ ID No. 409; SEQ ID No. 414; SEQ ID No. 419; SEQ ID 

10 No. 421; SEQ ID No. 423; SEQ ID No. 425; SEQ ID No. 438; SEQ ID No. 448; SEQ ID No. 453; 
SEQ ID No. 458; SEQ ID No. 466; SEQ ID No. 468; SEQ ID No. 470; SEQ ID No. 480; SEQ ID 
No. 489; SEQ ID No. 490; SEQ ID No. 496; SEQ ID No. 501; SEQ ID No. 504; SEQ ED No. 505; 
SEQ ID No. 506; SEQ ED No. 51 1; SEQ ED No. 513; SEQ ED No. 519; SEQ ED No. 526; SEQ ED 
No. 532; SEQ ED No. 538; SEQ ID No. 539; SEQ ED No. 547; SEQ ID No. 550; SEQ ID No. 561; 

15 SEQ ED No. 568; SEQ ED No. 570; SEQ ED No. 574; SEQ ID No. 578; SEQ ED No. 579; SEQ ID 
No. 580; SEQ ID No. 582; SEQ ID No. 589; SEQ ID No. 593; SEQ ED No. 598; SEQ ID No. 601; 
SEQ ID No. 604; SEQ ID No. 610; SEQ ID No. 613; SEQ ID No. 617; SEQ ID No. 626; SEQ ID 
No. 632; SEQ ID No. 635; SEQ ID No. 638; SEQ ID No. 640; SEQ ID No. 641; SEQ ID No. 646; 
SEQ ID No. 649; SEQ ID No. 650; SEQ ED No. 651; SEQ ID No. 686; SEQ ID No. 71 1 ; SEQ ED 

20 No. 724; SEQ ID No. 732; SEQ ID No. 734; SEQ ID No. 744; SEQ ID No. 745; SEQ ID No. 749; 
SEQID No. 751; SEQ ID No. 769; SEQ ID No. 770; SEQ ID No. 771; SEQ ID No. 773; SEQ ID 
No. 776; SEQ ED No. 779; SEQ ID No. 780; SEQ ID No. 785; SEQ ID No. 787; SEQ ID No. 789; 
SEQ ID No. 801; SEQ ID No. 805; SEQ ID No. 812; SEQ ED No. 822; SEQ ID No. 825; SEQ ID 
No. 826; SEQ ED No. 839; SEQ ID No. 841; SEQ ED No. 843; SEQ ED No. 853; SEQ ED No. 861; 

25 SEQ ID No. 875; SEQ ID No. 876; SEQ ID No. 886; SEQ ED No. 893; SEQ ED No. 898; SEQ ED 
No. 906; SEQ ID No. 907; SEQ ID No. 908; SEQ ID No. 920; SEQ ID No. 922; SEQ ID No. 925; 
SEQ ID No. 933; SEQ ID No. 935; SEQ ED No. 936; SEQ ED No. 944; SEQ ED No. 946; SEQ ED 
No. 947; SEQ ID No. 954; SEQ ID No. 959; SEQ ID No. 961; SEQ ID No. 966; SEQ ID No. 967; 
SEQ ID No. 972; SEQ ID No. 978; SEQ ID No. 995; SEQ ID No. 996; SEQ ID No. 1000; SEQ ID 

30 No. 1003; SEQID No. 1010; SEQID No. 1011; SEQID No. 1012; SEQID No. 1017; SEQID 
No. 1020; SEQID No. 1030; SEQID No. 1036; SEQID No. 1038; SEQID No. 1043; SEQID 
No. 1046; SEQID No. 1048; SEQID No. 1050; SEQID No. 1058; SEQID No. 1071; SEQID 
No. 1073; SEQ ED No. 1084; SEQ ID No. 1085; SEQ ID No. 1086; SEQ ID No. 1087; SEQ ID 
No. 1091; SEQ ED No. 1092; SEQ ED No. 1094; SEQ ED No. 1096; SEQID No. 1100; SEQID 

35 No. 1 104; SEQ ID No. 1 1 1 1 ; SEQ ID No. 1 1 12; SEQ ID No. 11 14; SEQ ID No. 1 1 17; SEQ ID 
No. 1 122; SEQ ID No. 1 125 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
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characterized in that it is a Chlamydia pneumoniae transmembrane polypeptide or one 
of its representative fragments, having at least 7 transmembrane domains, and in that it is chosen from 
the polypeptides having the following sequences: 

SEQ ID No. 17; SEQ ID No. 52; SEQ ID No. 68; SEQ ID No. 83; SEQ ID No. 87; SEQ ID No. 109; 
5 SEQ ID No 112, SEQ ID No. 113; SEQ ID No. 120; SEQ ID No. 121; SEQ ID No. 127; SEQ ID 
No 153- SEQ ID No.204;SEQID No. 211; SEQ ID No.218;SEQID No.223;SEQID No.275; 
SEQ ID No 277; SEQ ID No. 295; SEQ ID No. 300; SEQ ID No. 302; SEQ ID No. 306; SEQ ID 
No 327- SEQ ID No. 335; SEQ ID No. 342; SEQ ID No. 343; SEQ ID No. 347; SEQ ID No. 349; 
SEQ ID' No. 361; SEQ ID No. 363; SEQ ID No. 369; SEQ ID No. 380; SEQ ID No. 388; SEQ ID 
10 No 389- SEQ ID No. 397; SEQ ID No. 415; SEQ ID No. 432; SEQ ID No. 439; SEQ ID No. 446; 
SEQ ID' No 449; SEQ ID No. 472; SEQ ID No.478;SEQID No.500;SEQID No.522;SEQID 
No 524- SEQ ID No. 567; SEQ ID No. 575; SEQ ID No. 602; SEQ ID No. 606; SEQ ID No. 609; 
SEQ ID' No. 636; SEQ ID No. 639; SEQ ID No.643;SEQID No.653;SEQID No.668;SEQID 
No 692- SEQ ID No. 702; SEQ ID No. 704; SEQ ID No. 713; SEQ ID No. 720; SEQ ID No. 778; 
15 SEQ ID' No. 784; SEQ ID No.800;SEQID No.836;SEQID No.838;SEQID No.842;SEQID 
No 864- SEQ ID No. 867; SEQ ID No.883;SEQID No. 901; SEQ ID No.916;SEQID No. 932; 
SEQ Id' No 934; SEQ ID No. 940; SEQ ID No. 942; SEQ ID No. 950; SEQ ID No. 956; SEQ ID 
No 971- SEQ ID No. 973; SEQ ID No. 976; SEQ ID No. 988; SEQ ID No. 994; SEQ ID No. 1018 
SEQ K)' No. 1028; SEQ ID No. 1035; SEQ ID No. 1037; SEQ ID No. 1044; SEQ ID No. 1055 
20 SEQ ID No.l057;SEQID No. 1068; SEQ ID No. 1069; SEQ ID No. 1070; SEQ ID No. 107, 
SEQ ID No.l082;SEQID No. 1088; SEQ ID No. 1105; SEQ ID No. 1132; SEQ ID No.ll35and 
one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, m that it is a 
Chlamydia pneumoniae surface exposed polypeptide or one of its representative fragments, and in that 
25 it is chosen from the polypeptides having the following sequences: 

SEQ ID No. ,5, SEQ ID No. 25, SEQ ID No. 26, SEQ ID No. 27, SEQ ID No. 28, SEQ ID No. 29 
SEQ ID No 30, SEQ ID No. 31, SEQ ID No. 32, SEQ ID No. 33, SEQ ID No. 35, SEQ ID No. 36, 
SEQ ID No. 1257, SEQ ID No. 280, SEQ ID No. 291, SEQ ID No. 314, SEQ ID No. 354, SEQ ID 
No 380 SEQ ID No. 1266, SEQ ID No. 466, SEQ ID No. 467, SEQ ID No. 468, SEQ ID No. 469, 
30 SEQ ID No. 470, SEQ ID No. 472, SEQ ID No. 474, SEQ ID No. 476, SEQ ID No. 477, SEQ ID No. 
478 SEQ ID No. 479, SEQ ID No. 480, SEQ ID No. 482, SEQ ID No. 483, SEQ ID No. 485, SEQ ID 
No '486 SEQ ID No. 500, SEQ ID No. 501, SEQ ID No. 503, SEQ ID No. 504, SEQ ID No. 505, 
SEQ ID No 506, SEQ ID No. 507, SEQ ID No. 1268, SEQ ID No. 1269, SEQ ID No. 543, SEQ ID 
No 544 SEQ ID No. 578, SEQ ID No. 579, SEQ ID No. 580, SEQ ID No. 581, SEQ ID No. 595, 
35 SEQ ID No 596, SEQ ID No. 597, SEQ ID No. 1271, SEQ ID No. 633, SEQ ID No. 637, SEQ ID 
No 699 SEQ ID No. 706, SEQ ID No. 737, SEQ ID No. 744, SEQ ID No. 1273, SEQ ID No. 751, 
SEQ ID No 775, SEQ ID No. 776, SEQ ID No. 777, SEQ ID No. 793, SEQ ID No. 815, SEQ ID No. 
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830, SEQ ID No. 1221, SEQ ID No. 849, SEQ ID No. 851, SEQ ID No. 852, SEQ ID No. 874, SEQ 
ID No. 891, SEQ ID No. 922, SEQ ID No. 940, SEQ ID No. 1231, SEQ ID No. 1281, SEQ ID No. 
1035, SEQ ID No. 1079, SEQ ID No. 1087, SEQ ID No. 1108, and one of their representative 
fragments. 

5 Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia pneumoniae lipoprotein or one of its representative fragments, 
and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 3, SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 16, SEQ ID No. 1254, SEQ ID No. 
1255, SEQ ID No. 38, SEQ ID No. 1256, SEQ ID No. 62, SEQ ID No. 85, SEQ ID No. 1258, SEQ ID 

10 No. 115, SEQ ID No. 1151, SEQ ID No. 151, SEQ ID No. 1259, SEQ ID No. 173, SEQ ID No. 1261, 
SEQ ID No. 186, SEQ ID No. 194, SEQ ID No. 205, SEQ ID No. 214, SEQ ID No. 216, SEQ ID No. 
217, SEQ ID No. 238, SEQ ID No. 1 177, SEQ ID No. 280, SEQ ID No. 291, SEQ ID No. 317, SEQ 
ID No. 327, SEQ ID No. 354, SEQ ID No. 364, SEQ ID No. 367, SEQ ID No. 414, SEQ ID No. 432, 
SEQ ID No. 1 192, SEQ ID No. 460, SEQ ID No. 1267, SEQ ID No. 1268, SEQ ID No. 520, SEQ ID 

15 No. 536, SEQ ID No. 1270, SEQ ID No. 576, SEQ ID No. 597, SEQ ID No. 603, SEQ ID No. 609, 
SEQ ID No. 637, SEQ ID No. 1272, SEQ ED No. 652, SEQ ID No. 1213, SEQ ID No. 699, SEQ ID 
No. 705, SEQ ID No. 706, SEQ ID No. 708, SEQ ID No. 711, SEQ ID No. 727, SEQ ID No. 1274, 
SEQ ID No. 800, SEQ ID No. 814, SEQ ID No. 825, SEQ ID No. 829, SEQ ID No. 830, SEQ ID No. 

831, SEQ ID No. 844, SEQ ID No. 849, SEQ ID No. 1275, SEQ ID No. 1276, SEQ ID No. 1277, SEQ 
20 ID No. 872, SEQ ID No. 878, SEQ ID No. 880, SEQ ID No. 891, SEQ ID No. 892, SEQ ID No. 1278, 

SEQ ID No. 1279, SEQ ID No. 1280, SEQ ID No. 941, SEQ ID No. 942, SEQ ID No. 1282, SEQ ID 
No. 1283, SEQ ID No. 952, SEQ ID No. 988, SEQ ID No. 998, SEQ ID No. 1009, SEQ ID No. 1285, 
SEQ ID No. 1235, SEQ ID No. 1028, SEQ ID No. 1056, SEQ ED No. 1070, SEQ ID No. 1287, SEQ 
ID No. 1087, SEQ ID No. 1288, SEQ ID No. 1289, SEQ ID No. 1098, SEQ ID No. 1246, SEQ ID No. 

25 1291, SEQ ID No. 1108, SEQ ID No. 1109, SEQ ID No. 1112, SEQ ID No. 1133, and one of their 
representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it is a 
Chlamydia pneumoniae polypeptide involved in lipopolysaccharide (LPS) biosynthesis, and in that it 
is chosen from the polypeptides having the following sequences: 

30 SEQ ID No. 316, SEQ ID No. 564, SEQ ID No. 610, SEQ ID No. 647, SEQ ID No. 1211, SEQ ID 
No. 688, SEQ ID No. 924, and one of their representative fragments. 

Preferably, the invention relates to additional LPS-related polypeptides according to the 
invention, in that it is: 

(a) a Chlamydia pneumoniae KDO (3-deoxy-D-manno-octylosonic acid)-related 
35 polypeptide or one of its representative fragments, and in that it is chosen from the polypeptides 
having the following sequences: SEQ ID No. 177, SEQ ED No. 1 156, SEQ ID No. 245, SEQ ID No. 
767, and one of their representative fragments; 
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(b) a Chlamydia pneumoniae phosphomannomutase-related polypeptide or one 
of its representative fragments, and in that it is chosen from the polypeptides having the foiling 
sequences: SEQ ID No. 74, and its representative fragment; 

(c) a Chlamydia pneumoniae phosphoglueomutase-related polypeptide or one of .ts 
S representative fragments, and in that it is chosen from the polypeptides having the following 

sequences- SEQ ID No. 1286, SEQ ID No. 1039, and its representative fragment; and 

(d) a Chlamydia pneumoniae lipid A component-related polypeptide or one of .ts 
representative fragments, and in that it is chosen from the polypeptides having the foiling 
sequences: SEQ ID No. 689. SEQ ID No. 690, SEQ ID No. 691, SEQ ID No. 1037, and one of therr 

X0 repreSentatlVe J^" y U the . nvention relates t0 a polypeptide according to the invention, in that it is a 
Chlamydia pneumoniae polypeptide or one of its representative fragments that contains an RGD 
seqU ence and is also an outer membrane protein, and in that it is chosen from the polypepudes havmg 
the following sequences: SEQ ID No. 468 and its representative fragment, 

Preferably, the invention relates to a polypeptide according to the mventoon, m that « 
Chlamydia pneumoniae polypeptide or one of its representative fragments that contains an ROD 

and in that it is chosen from the polypeptides having the following sequences: 
SEO ID No. 350 and its representative fragments. 
2 o Preferably, the invention relates to a polypeptide according to the invention, m that „ .s a 

Chlamydia pneumoniae polypeptide or one of its representative fragments that is cysteine rich and 
clns ROD sequence, and in that it is chosen from the polypeptides having the following 
sequences: SEQ ID No. 1290, SEQ ID No. 6846, SEQ ID No. 6848, and one of their representee 
fragments. ^ ^ ^ ^ ^ ^ ^ in tnat it is a 

Chlamydia pneumoniae outer membrane polypeptide that contains cysteines in their first 30 ami™ 
acids and also contain an RGD sequence, and in that it is chosen from the polypeptrdes havmg the 

30 Xeqidno.268,seqidno. 1 ^«mH.m«mH.^«^«Q 

IDNo 45 1 ,SEQ ro No.452,SEQIDNo.453,SEQIDNo.473,SEQIDNo.499,SEQIDNo^515, 
SEQ ID No 519, SEQ ID No. 525, SEQ ID No. 526, SEQ ID No. 538, SEQ ID No. 61 1, SEQ 
645^ SEQ ID No 68^ SEQ ID No. 700, SEQ ID No. 746, SEQ ID No. 755, SEQ ID No. 756, SE ^ ® 
No 757 SEQ ID No. 789, SEQ ID No. 814, SEQ ID No. 855, SEQ ID No. 856, SEQ ID No. 878, 
35 SEQ ID No. 957, SEQ ID No. 958, SEQ ID No. 989, SEQ ID No. 1290, and one of then 

representative fragments. 

Preferably, the invention relates to a polypeptide accordmg to the mvent.on, in that .t ,s 
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Chlamydia pneumoniae polypeptide or one of its representative fragments that contains RGD 
sequences homologous to Chlamydia trachomatis polypeptides containing RGD sequences, and in that 
it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 1 14, SEQ ID No. 468, SEQ ID No. 755, SEQ ID No. 756, SEQ ID No. 757, SEQ ID No. 
5 855, SEQ ID No. 856, SEQ ID No. 905, SEQ ID No. 913, SEQ ID No. 914, SEQ ID No. 915, and one 
of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it is a 
Chlamydia pneumoniae Type III and non-Type III secreted polypeptide or one of its representative 
fragments, and in that it is chosen from the polypeptides having the following sequences: 

1 0 SEQ ID No. 25, SEQ ID No. 28, SEQ ID No. 29, SEQ ID No. 33, SEQ ID No. 308, SEQ ID No. 309, 
SEQ ID No. 343, SEQ ID No. 344, SEQ ID No. 345, SEQ ID No. 367, SEQ ID No. 414, SEQ ID No. 
415, SEQ ID No. 480, SEQ ID No. 550, SEQ ID No. 579, SEQ ID No. 580, SEQ ID No. 581, SEQ ID 
No. 597, SEQ ID No. 699, SEQ ID No. 744, SEQ ID No. 751, SEQ ID No. 776, SEQ ID No. 866, 
SEQ ID No. 874, SEQ ID No. 883, SEQ ID No. 884, SEQ ID No. 888, SEQ ID No. 891, SEQ ID No. 

1 5 6845, and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it is a 
Chlamydia pneumoniae cell wall anchored surface polypeptide or one of its representative fragments, 
and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 267, SEQ ID No. 271, SEQ ID No. 419, SEQ ID No. 590, SEQ ED No. 932, SEQ ID No. 
2 0 6844, SEQ ID No. 6847, and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it is a 
Chlamydia pneumoniae polypeptide or one of its representative fragments not found in Chlamydia 
trachomatis (Blastp P>e" 10 ), and in that it is chosen from the polypeptides having the following 
sequences: 

25 SEQ ID No. 7, SEQ ID No. 8, SEQ ID No. 9, SEQ ID No. 16, SEQ ID No. 17, SEQ ID No. 18, SEQ 
ID No. 19, SEQ ID No. 20, SEQ ID No. 21, SEQ ID No. 22, SEQ ID No. 1254, SEQ ID No. 23, SEQ 
ID No. 1255, SEQ ID No. 24, SEQ ID No. 1 139, SEQ ID No. 1 140, SEQ ID No. 46, SEQ ID No. 47, 
SEQ ID No. 51, SEQ ID No. 60, SEQ ID No. 1256, SEQ ID No. 61, SEQ ID No. 62, SEQ ID No. 63, 
SEQ ID No. 64, SEQ ID No. 1257, SEQ ID No. 65, SEQ ID No. 66, SEQ ID No. 67, SEQ ID No. 68, 

30 SEQ ID No. 1143, SEQ ID No. 1145, SEQ ID No. 83, SEQ ID No. 84, SEQ ID No. 1146, SEQ ID 
No. 85, SEQ ID No. 86, SEQ ID No. 87, SEQ ID No. 1258, SEQ ID No. 1 16, SEQ ID No. 1 17, SEQ 
ID No. 125, SEQ ID No. 1 148, SEQ ID No. 143, SEQ ID No. 1 150, SEQ ID No. 1 15 1, SEQ ID No. 
144, SEQ ID No. 145, SEQ ID No. 147, SEQ ID No. 148, SEQ ID No. 149, SEQ ID No. 150, SEQ ID 
No. 152, SEQ ID No. 1259, SEQ ID No. 162, SEQ ID No. 166, SEQ ID No. 1 154, SEQ ID No. 167, 

35 SEQ ID No. 1261, SEQ ID No. 1 156, SEQ ID No. 1 157, SEQ ID No. 178, SEQ ID No. 179, SEQ ID 
No. 1158, SEQ ID No. 182, SEQ ID No. 183, SEQ ID No. 184, SEQ ID No. 185, SEQ ID No. 1159, 
SEQ ID No. 186, SEQ ID No. 1160, SEQ ID No. 187, SEQ ID No. 188, SEQ ID No. 189, SEQ ID 
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No. 190, SEQ ID No. 1161, SEQ ID No. 1162, SEQ ID No. 191, SEQ ID No. 192, SEQ ID No. 
194, SEQ ID No. 195, SEQ ID No. 1163, SEQ ID No. 196, SEQ ID No. 201, SEQ ID No. 202, SEQ 
ID No. 209, SEQ ID No. 212, SEQ ID No. 221, SEQ ID No. 224, SEQ ID No. 1 167, SEQ ID No. 226, 
SEQ ID No. 227, SEQ ID No. 228, SEQ ID No. 229, SEQ ID No. 230, SEQ ID No. 231 , SEQ ID No. 
5 232, SEQ ID No. 1 169, SEQ ID No. 1 170, SEQ ID No. 1 17 1 , SEQ ID No. 234, SEQ ID No. 235, SEQ 
ID No. 236, SEQ ID No. 1172, SEQ ID No. 243, SEQ ID No. 251, SEQ ID No. 252, SEQ ID No. 
1 176, SEQ ID No. 253, SEQ ID No. 255, SEQ ID No. 254, SEQ ID No. 256, SEQ ID No. 1 177, SEQ 
ID No. 1178, SEQ ID No. 262, SEQ ID No. 263, SEQ ID No. 1264, SEQ ID No. 278, SEQ ID No. 
279 SEQ ID No. 1 180, SEQ ID No. 280, SEQ ID No. 290, SEQ ID No. 291, SEQ ID No. 292, SEQ 
10 ID No. 296, SEQ ID No. 1181, SEQ ID No. 297, SEQ ID No. 298, SEQ ID No. 300, SEQ ID No. 
1265 SEQ ID No. 322, SEQ ID No. 324, SEQ ID No. 325, SEQ ID No. 370, SEQ ID No. 1186, SEQ 
ID No. 371, SEQ ID No. 372, SEQ ID No. 1187, SEQ ID No. 373, SEQ ID No. 378, SEQ ID No. 
1266, SEQ ID No. 382, SEQ ID No. 383, SEQ ID No. 384, SEQ ID No. 385, SEQ ID No. 386, SEQ 
ID No. 1 188, SEQ ID No. 1 189, SEQ ID No. 391, SEQ ID No. 392, SEQ ID No. 398, SEQ ID No. 
15 400, SEQ ID No. 403, SEQ ID No. 1 191, SEQ ID No. 423, SEQ ID No. 435, SEQ ID No. 445, SEQ 
ID No. 450, SEQ ID No. 1 193, SEQ ID No. 456, SEQ ID No. 460, SEQ ID No. 461, SEQ ID No. 465, 
SEQ ID No. 1 196, SEQ ID No. 471, SEQ ID No. 473, SEQ ID No. 475, SEQ ID No. 481, SEQ ID 
No. 484, SEQ ID No. 487, SEQ ID No. 488, SEQ ID No. 489, SEQ ID No. 490, SEQ ID No. 491, 
SEQ ID No. 492, SEQ ID No. 493, SEQ ID No. 494, SEQ ID No. 495, SEQ ID No. 496, SEQ ID No. 
2 0 497 SEQ ID No. 498, SEQ ID No. 499, SEQ ID No. 502, SEQ ID No. 1267, SEQ ID No. 1268, SEQ 
ID No. 508, SEQ ID No. 510, SEQ ID No. 509, SEQ ED No. 512, SEQ ID No. 515, SEQ ID No. 519, 
SEQ ID No. 1 197, SEQ ID No. 521, SEQ ID No. 1 198, SEQ ID No. 522, SEQ ID No. 524, SEQ ID 
No 528, SEQ ID No. 534, SEQ ID No. 537, SEQ ID No. 1269, SEQ ID No. 1270, SEQ ID No. 548, 
SEQ ID No. 551, SEQ ID No. 557, SEQ ID No. 1201, SEQ ID No. 1203, SEQ ID No. 562, SEQ ID 
25 No. 566, SEQ ID No. 593, SEQ ID No. 595, SEQ ID No. 600, SEQ ID No. 1271, SEQ ID No. 604, 
SEQ ID No. 61 1, SEQ ID No. 612, SEQ ID No. 614, SEQ ID No. 616, SEQ ID No. 625, SEQ ID No. 
627 SEQ ID No. 628, SEQ ID No. 629, SEQ ID No. 631, SEQ ID No. 641, SEQ ID No. 1272, SEQ 
ID No. 648, SEQ ID No. 1212, SEQ ID No. 663, SEQ ID No. 685, SEQ ID No. 707, SEQ ID No. 714, 
SEQ ID No. 715, SEQ ID No. 716, SEQ ID No. 717, SEQ ID No. 722, SEQ ID No. 746, SEQ ID No. 
30 1273, SEQ ID No. 761, SEQ ID No. 764, SEQ ID No. 770, SEQ ID No. 1217, SEQ ID No. 783, SEQ 
ID No. 1274, SEQ ID No. 803, SEQ ID No. 815, SEQ ID No. 1220, SEQ ID No. 835, SEQ ID No. 
1221, SEQ ID No. 844, SEQ ID No. 845, SEQ ID No. 846, SEQ ID No. 847, SEQ ID No. 848, SEQ 
ID No. 849, SEQ ID No. 850, SEQ ID No. 851, SEQ ID No. 1275, SEQ ID No. 852, SEQ ID No. 862, 
SEQ ID No. 1276, SEQ ID No. 1277, SEQ ID No. 873, SEQ ID No. 1223, SEQ ID No. 892, SEQ ID 
35 No. 919, SEQ ID No. 1225, SEQ ID No. 1278, SEQ ID No. 926, SEQ ID No. 1228, SEQ ID No. 
1229, SEQ ID No. 1230, SEQ ID No. 1279, SEQ ID No. 1281, SEQ ID No. 1282, SEQ ID No. 1283. 
SEQ ID No. 948, SEQ ID No. 950, SEQ ID No. 949, SEQ ID No. 951, SEQ ID No. 980, SEQ ID No. 
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982, SEQ ID No. 1233, SEQ ID No. 999, SEQ ID No. 1000, SEQ ID No. 1001, SEQ ID No. 1002, 
SEQ ID No. 1008, SEQ ID No. 1285, SEQ ID No. 1235, SEQ ID No. 1016, SEQ ID No. 1019, SEQ 
ID No. 1027, SEQ ID No. 1036, SEQ ID No. 1241, SEQ ID No. 1048, SEQ ID No. 1049, SEQ ID No. 
1050, SEQ ID No. 1053, SEQ ID No. 1054, SEQ ID No. 1064, SEQ ID No. 1076, SEQ ID No. 1091, 
5 SEQ ID No. 1288, SEQ ID No. 1093, SEQ ID No. 1289, SEQ ID No. 1 101, SEQ ID No. 1 103, SEQ 
ID No. 1245, SEQ ID No. 1246, SEQ ID No. 1247, SEQ ID No. 1290, SEQ ID No. 1291, SEQ ID No. 
1 1 15, SEQ ID No. 1 1 16, SEQ ID No. 1 1 18, SEQ ID No. 1 120, SEQ ID No. 1249, SEQ ID No. 1 121, 
SEQ ID No. 1250, SEQ ID No. 1 126, SEQ ID No. 1251, SEQ ID No. 1 127, SEQ ID No. 1 128, SEQ 
ID No. 1130, SEQ ID No. 1129, SEQ ID No. 1131, SEQ ID No. 1136, SEQ ID No. 1253, SEQ ID No. 
10 6844, SEQ ID No. 6846, SEQ ID No. 6847, SEQ ID No. 6848, and one of their representative 
fragments 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia pneumoniae polypeptide or one of its representative fragments 
which is involved in the intermediate metabolism, in particular in the metabolism of sugars and/or of 

15 cofactors, and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 2; SEQ ID No. 55; SEQ ID No. 56; SEQ ID No. 69; SEQ ID No. 75; SEQ ID No. 80; 
SEQ ID No. 100; SEQ ID No. 1 10; SEQ ID No. 114; SEQ ID No. 120; SEQ ID No. 121 ; SEQ ID 
No. 157; SEQ ID No. 160; SEQ ID No. 161; SEQ ID No. 172; SEQ ID No. 180; SEQ ED No. 181; 
SEQ ID No. 198; SEQ ID No. 200; SEQ ID No. 225; SEQ ID No. 248; SEQ ID No. 249; SEQ ID 

20 No. 276; SEQ ID No. 277; SEQ ID No. 318; SEQ ID No. 319; SEQ ID No. 320; SEQ ED No. 323; 
SEQ ED No. 331; SEQ ID No. 347; SEQ ED No. 375; SEQ ID No. 376; SEQ ED No. 381; SEQ ED 
No. 393; SEQ ID No. 394; SEQ ID No. 395; SEQ ID No. 396; SEQ ID No. 409; SEQ ID No. 446; 
SEQ ID No. 447; SEQ ID No. 448; SEQ ID No. 449; SEQ ID No. 513; SEQ ID No. 516; SEQ ID 
No. 571; SEQ ED No. 647; SEQ ED No. 662; SEQ ID No. 697; SEQ ED No. 718; SEQ ED No. 793; 

25 SEQ ID No. 794; SEQ ID No. 808; SEQ ID No. 809; SEQ ID No. 838; SEQ ID No. 839; SEQ ID 
No. 840; SEQ ED No. 853; SEQ ED No. 854; SEQ ID No. 918; SEQ ED No. 923; SEQ ED No. 929; 
SEQ ID No. 931 ; SEQ ID No. 938; SEQ ID No. 939; SEQ ID No. 958; SEQ ID No. 959; SEQ ID 
No. 960; SEQ ED No. 966; SEQ ED No. 995; SEQ ID No. 1021; SEQ ED No. 1040; SEQ ID 
No. 1041; SEQ ID No. 1042; SEQ ID No. 1085; SEQ ID No. 1100; SEQ ID No. 1102; SEQ ID 

30 No. 1117; SEQ ID No. 1118; SEQ ID No. 1119; SEQ ID No. 1120; SEQ ID No. 1135 and one of 
their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia pneumoniae polypeptide or one of its representative fragments 
which is involved in the intermediate metabolism of nucleotides or nucleic acids, and in that it is 

3 5 chosen from the polypeptides having the following sequences: 

SEQ ID No. 77; SEQ ID No. 78; SEQ ID No. 138; SEQ ID No. 189; SEQ ED No. 190; SEQ ED 
No. 233; SEQ ID No. 246; SEQ ID No. 338; SEQ ID No. 412; SEQ ID No. 421 ; SEQ ID No. 438; 
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SEQID No.607; SEQID N„.648; SEQID No.657; SEQID No.740 ; SEQ.D fenk 
SEQ E> No. 967, SEQ ID No. 989; SEQ ID No. 990; SEQ ID No. 992; SEQ ID No. 10. 1; SEQ ID 
No. 1058; SEQ ID No. 1059; SEQ ID No. 1073; SEQ ID No. 1074 and one of .heir 
fragments. 

5 ft.fcn.bly, Ike invention .elaes » a polypeptide according to tke tnvennon, 

chtnacKrized in ft. i. is . Ckton^n pr— « polypeptide « one of ks repressive ftngma* 
which is involved in .he .nerabohsm of nucleic acids, and in .ha, i. is chosen horn ,he polypepttdes 
having the following sequences: 

SEQID No 14- SEQID No. 59; SEQID No. 70; SEQID No. 7,; SEQID No. 97, SEQID 
!. N?„3- SEQID No.,37;SEQID No. 141; SEQID No. 169; SEQ ID N„.285;SEQm No^287; 
SEQ H>' No 288; SEQ ID N„.313;SEQ,D N„.326;SEQID No. 358; SEQ ID 
No 443' SEQID No. 548; SEQID No. 569; SEQ ID No. 601; SEQ ID N„.65.;SEQID Ntftt 
SEQ ID* No 658; SEQ ID No.659.SEQK. No.664;SE Qm No.665;SEQE3 No.6c*;SEQ03 
No 698 SEQ ID No. 704; SEQ ID No. 760; SEQ ID No. 762; SEQ ID No. 763; SEQ ID Na ** 
15 SEQ.D No 787;SEQID No. 788; SEQ H> No.801;SEQID No. 802; SEQ ID No.8,2; SEQID 
» .» N„.870;SEQ,D No.89,;SEQID No.8,8;SEQ,D N.90* 
SEQ ID No 908; SEQ ID No. 916; SEQID No.954;SEQID No. 955; SEQ ID No.96,; S EQ» 
No 983- SEQ ID NO. 996; SEQID No. 10O7; SEQID No. .012; SEQID No. .013; SEQID 
No. ,014; SEQ ID No. ,015; SEQ ID No. 1038; SEQ ID No. 1 .37 and one of thei, represent,.™ 
20 fragment,. ^ ^ ^ ^ ^ ^ 

chancr.riaed in ,h,t i, is a CWomyd/o pn~e po.ypepMe or one of its representative fragments 
which is .nvolvcd in the metabohsm o, and™ acid, „, po.ypep.ide,. and in tha. U i, chosen from ,he 
polypeptides having .he following sequences: 
2 S SEQm NO 99; SEQID No. Ill; SEQID No. 127; SEQ ID No. .34; SEQID No. ,40; SEQID 
No 174' SEQ ID n!>75;SEQ,D No. 176; SEQ ID No.353;SEQID No. 377; SEQ ID No.4M; 
SEQID No. 523; SEQ ID No.539;SEQID No.559;SEQID No.56,;SEQID No. 586; SEQ ID 
NO 598- SEQ ID No. 609: SEQ ID No. 636; SEQID No. 687; SEQ ID No. 700; SEQ ID No. 701; 

SEQID No.,59:SEQ,D No. 790; SEQ ID No.85,;SEQID No.86,;SEQID N.90, SEQID 
30 No 936; SEQ ID No. 952; SEQ ID No. 962; SEQID No. 963: SEQ ID No. 964; SEQ ID No. 9 . 

SEQID NO 99, SEQID No. 1003; SEQID No. ,004; SEQID No. .005; SEQ* No. ; 

SEQID No. 1067: SEQID No. 1110; SEQ ID No. 1111; SEQID No. 1112; SEQID No.1.4; 

SEQID N0.112,;SEQID No. 1122; SEQID No. 1123; SEQID No. 1124; SEQ ID No..l25and 

one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invent.on, 
characterized in that it is a Chlamydia pneumoniae polypeptide or one of its representative fragments 
which is involved in the metabolism of polypeptides, and in that it is chosen from the polypeptides 
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having the following sequences: 

SEQID No. 4; SEQID No. 44; SEQID No. 45; SEQ ID No. 48; SEQ ID No. 54; SEQID 
No. 1 12; SEQ ID No. 130; SEQ ID No. 155; SEQ ID No. 163; SEQ ID No. 212; SEQ ID No. 257; 
SEQ ID No. 307; SEQ ID No. 343; SEQ ID No. 405; SEQ ID No. 416; SEQ ID No. 458; SEQ ID 
5 No. 540; SEQ ID No. 541; SEQ ID No. 542; SEQ ID No. 543; SEQ ID No. 544; SEQ ID No. 560; 
SEQ ID No. 594; SEQ ID No. 652; SEQ ID No. 699; SEQ ID No. 723; SEQ ID No. 747; SEQ ID 
No. 817; SEQ ID No. 827; SEQ ID No. 871; SEQ ID No. 909; SEQ ID No. 910; SEQ ID No. 911; 
SEQID No. 912; SEQID No. 1023; SEQID No. 1051; SEQID No. 1052; SEQ ID No. 1081 and 
one of their representative fragments. 
10 Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia pneumoniae polypeptide or one of its representative fragments 
which is involved in the metabolism of fatty acids, and in that it is chosen from the polypeptides 
having the following sequences: 

SEQ ID No. 76; SEQ ID No. 284; SEQ ID No. 308; SEQ ID No. 309; SEQ ID No. 310; SEQ ID 
15 No. 31 1 ; SEQ ID No. 312; SEQ ID No. 425; SEQ ID No. 433; SEQ ID No. 565; SEQ ID No. 688; 
SEQ ID No. 690; SEQ ID No. 691; SEQ ID No. 767; SEQ ID No. 797; SEQ ID No. 894; SEQ ID 
No. 895; SEQID No. 994; SEQID No. 1020; SEQID No. 1030; SEQID No. 1033; SEQID 
No. 1034; SEQ ID No. 1046; SEQ ID No. 1047; SEQ ID No. 1057 and one of their representative 
fragments. 

20 Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia pneumoniae polypeptide or one of its representative fragments 
which is involved in the synthesis of the wall, and in that it is chosen from the polypeptides having the 
following sequences: 

SEQID No. 49; SEQID No. 50; SEQID No. 177; SEQID No. 178; SEQID No. 245; SEQID 

2 5 No. 610; SEQ ID No. 972; SEQ ID No. 974; SEQ ID No. 978; SEQ ID No. 1037 and one of their 

representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia pneumoniae polypeptide or one of its representative fragments 
which is involved in the transcription, translation and/or maturation process, and in that it is chosen 

3 0 from the polypeptides having the following sequences: 

SEQID No. 90; SEQ ID No. 92; SEQID No. 131; SEQID No. 151; SEQID No. 199; SEQID 
No. 333; SEQ ID No. 334; SEQ ID No. 336; SEQ ID No. 379; SEQ ID No. 589; SEQ ID No. 590; 
SEQID No. 619; SEQ ID No. 630; SEQ ID No. 649; SEQ ID No. 739; SEQ ID No. 741; SEQID 
No. 806; SEQ ID No. 821; SEQ ID No. 843; SEQ ID No. 968; SEQ ID No. 971; SEQ ID No. 1061 
3 5 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia pneumoniae ribosomal polypeptide or one of its representative 
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fragments, and in that it is chosen from the polypeptides having the following sequences: 

No. 93; SEQ ID No. 94; SEQID No.95; SEQID No. 136; SEQID No.259; SEQID 
No 332- SEQ ID No. 348; SEQ ID No.583;SEQID No.584; SEQID No.588; SEQID No. 591; 
SEQ Id' No 592; SEQ ID No. 663; SEQ ID No. 666; SEQ ID No. 667; SEQ ID No. 669; SEQ ID 
5 No 670- SEQ ID No. 671; SEQ ID No. 672; SEQ ID No. 673; SEQ ID No. 674; SEQ ID No. 675; 
SEQ ID No. 676; SEQ ID No. 677; SEQ ID No. 678; SEQ ID No. 679; SEQ ID No. 680; SEQ ID 
No 681- SEQ ID No. 683; SEQ ID No. 684; SEQ ID No. 738; SEQ ID No. 781; SEQ ID No. 1008; 
SEQID No 1024- SEQ ID No. 1025; SEQ ID No. 1066 and one of their representative fragments. 

Preferably, the invention also relates to a polypeptide according to the invention, 
10 characterized in that it is a Chlamydia pneumoniae transport polypeptide or one of its representative 
fragments, and in that it is chosen from the polypeptides having the following sequences: 
SEQID No 40; SEQID No. 41; SEQID No. 52; SEQID No. 105; SEQID No. 106; SEQID 
No 107- SEQ ID No. 109; SEQ ID No. 133; SEQID No. 210; SEQ ID No. 211; SEQ ID No. 214; 
SEQ ID' No. 215; SEQ ID No.216; SEQID No.217;SEQID No.218; SEQID No. 219; SEQID 
15 No 220; SEQ ID No. 223; SEQID No. 242; SEQ ID No.260; SEQID No.293;SEQID No. 299; 
SEQID No 366; SEQ ID No. 369; SEQ ID No.575;SEQID No.602; SEQID No. 638; SEQID 
No 639- SEQ ID No. 640; SEQ ID No. 643; SEQ ID No. 653; SEQ ID No. 702; SEQ ID No. 703; 
SEQID No 724; SEQ ID No. 732; SEQ ID No. 855; SEQ ID No. 856; SEQ ID No. 901; SEQ ID 
No 906; SEQID No. 933; SEQ ID No. 942; SEQID No. 1043; SEQID No. 1086; SEQID 
20 No 1105 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia pneumoniae polypeptide or one of its representative fragments 
which is involved in the virulence process, and in that it is chosen from the polypeptides havmg the 
following sequences : 

25 SEQID No.546; SEQID No.550; SEQID No. 778; SEQID No. 779; SEQ ID No. 886 and one 

of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia pneumoniae polypeptide or one of its representative fragments 
which is involved in the secretory system and/or which is secreted, and in that it is chosen from the 
30 polypeptides having the following sequences: 

r EQ m No. 751; SEQID No.874;SEQID No.875;SEQID No. 876; SEQ ID No.883;SEQID 
No 884- SEQ ID No. 885 and one of their representative fragments. 

The secreted polypeptides, including the Type III and other, non-Type III secreted 
polypeptides, of the present invention, as well as the corresponding nucleotide sequences, may be 
35 detected by techniques known to persons skilled in the art, such as for example the techniques usmg 
cloning combined with vectors allowing the expression of the said polypeptides msed to export 
markers such as the luc gene for luciferase or the PhoA gene for alkaline phosphatase. 
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Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a polypeptide specific to Chlamydia pneumoniae or one of its representative 
fragments(with a Blast E value of >10" 5 ), and in that it is chosen from the polypeptides having the 
following sequences: 

5 SEQ ID No. 7; SEQ ID No. 8; SEQ ID No. 17; SEQ ID No. 18; SEQ ID No. 19; SEQ ID No. 20; 
SEQID No. 22; SEQ ID No. 23; SEQ ID No. 24; SEQ ID No. 51; SEQ ID No. 60; SEQ ID 
No. 63; SEQID No. 65; SEQID No. 66; SEQID No. 67; SEQID No. 83; SEQID No. 84; 
SEQID No. 86; SEQID No. 87; SEQID No. 125; SEQID No. 143; SEQID No. 144; SEQID 
No. 179; SEQ ID No. 182; SEQ ID No. 184; SEQ ID No. 185; SEQ ID No. 187; SEQ ID No. 221; 

10 SEQ ID No. 252; SEQ ID No. 254;; SEQ ID No. 278; SEQ ID No. 279; SEQ ID No. 387; SEQ ID 
No. 388; SEQ ID No. 397; SEQ ID No. 1048; SEQ ID No. 1049; SEQ ID No. 1050; SEQ ID 
No. 1 128; SEQ ID No. 1 130; SEQ ID No. 1 131 and one of their representative fragments. 

In general, in the present invention, the functional group to which a polypeptide of the 
invention belongs, as well as its corresponding nucleotide sequence, may be determined either by 

15 comparative analogy with sequences already known, or by the use of standard techniques of 
biochemistry, of cytology combined with the techniques of genetic engineering such as 
immunoaffinity, localization by immunolabelling, differential extraction, measurement of enzymatic 
activity, study of the activity inducing or repressing expression or the study of expression in E. coli. 

It is clearly understood, on the one hand, that, in the present invention, the nucleotide 

20 sequences (ORF) and the amino acid sequences (SEQ ID No. 2 to SEQ ID No. 1291 and SEQ ID No. 
6844 to SEQ ID No. 6848) which are listed by functional group, are not exhaustive within the group 
considered. Moreover, it is also clearly understood that, in the present invention, a nucleotide sequence 
(ORF) or an amino acid sequence mentioned within a given functional group may also be part of 
another group taking into account, for example, the interrelationship between the groups listed. 

2 5 Accordingly, and as an example of this interrelationship, an exported and/or secreted polypeptide as 

well as its coding nucleotide sequence may also be involved in the Chlamydia pneumoniae virulence 
process by modifying the defense mechanism of the infected host cell, or a transmembrane 
polypeptide or its coding nucleotide sequence is also part of the polypeptides or coding nucleotide 
sequences of the cellular envelope. 

3 0 The subject of the present invention is also the nucleotide and/or polypeptide sequences 

according to the invention, characterized in that the said sequences are recorded on a medium, called 
recording medium, whose type and nature facilitate the reading, the analysis and the exploitation of 
the said sequences. These media may of course also contain other information extracted from the 
present invention, such as in particular the analogies with already known sequences, such as those 
35 mentioned in Table 1 of the present description, and/or may contain, in addition, information relating 
to the nucleotide and/or polypeptide sequences of other microorganisms so as to facilitate the 
comparative analysis and the exploitation of the results obtained. 
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Among these recording media, computer-readable media, such as magnetic, 
optical, electrical and hybrid media such as, for example, floppy disks, CD-ROMs or recording 
cassettes, are preferred in particular. 

The invention also relates to nucleotide sequences which can be used as primer or probe, 
5 characterized in that the said sequences are chosen from the nucleotide sequences according to the 
invention. 

The invention relates, in addition, to the use of a nucleotide sequence according to the 
invention, as primer or probe, for the detection and/or amplification of nucleic acid sequences. 

The nucleotide sequences according to the invention may thus be used to amplify 
10 nucleotide sequences, in particular by the PCR technique (polymerase chain reaction) (Erlich, 1989; 
Innis et al., 1990; Rolfs et al., 1991, and White et al., 1997). 

These oligodeoxyribonucleotide or oligoribonucleotide primers correspond to 
representative nucleotide fragments, and are advantageously at least 8 nucleotides, preferably at least 
12 nucleotides, 15 nucleotides and still more preferably at least 20 nucleotides long. 
1 5 Other techniques for amplifying the target nucleic acid may be advantageously used as 

alternatives to PCR. 

The nucleotide sequences of the invention, in particular the primers according to the 
invention, may also be used in other methods for amplifying a target nucleic acid, such as: 

- the TAS (Transcription-based Amplification System) technique described by Kwoh et al. in 1989; 
20 - the 3SR (Self-Sustained Sequence Replication) technique described by Guatelli et al. in 1990; 

- the NASBA (Nucleic Acid Sequence Based Amplification) technique described by Kievitis et al. 

in 1991; 

the SDA (Strand Displacement Amplification) technique (Walker et al., 1992); 

- the TMA (Transcription Mediated Amplification) technique. 

2 5 The polynucleotides of the invention may also be used in techniques for amplifying or for 

modifying the nucleic acid serving as probe, such as: 

- the LCR (Ligase Chain Reaction) technique described by Landegren et al. in 1988 and perfected 

by Barany et al. in 1991 , which uses a thermostable ligase; 

- the RCR (Repair Chain Reaction) technique described by Segev in 1992; 

3 0 - the CPR (Cycling Probe Reaction) technique described by Duck et al. in 1990; 

- the Q-beta-re P licase amplification technique described by Miele et al. in 1983 and perfected in 

particular by Chu et al. in 1986, Lizardi et al. in 1988, and then by Burg et al. as well as by 
Stone et al. in 1996. 

The invention also relates to the nucleotide sequences of fragments which can be 
35 obtained by amplification with the aid of at least one primer according to the invention. The present 
invention encompasses both hybridization probes and primers. In general, the complementary probes 
should be of a length sufficient to form a stable hybrid complex with the target sequences . Primers, 
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while complementary to the target sequences need not form stable hybridization complexes 
with the target sequences alone. Rather, primers form stable complexes with the target sequences in 
the presence of polymerase to permit extension of the primer. 

In the case where the target polynucleotide to be detected is possibly an RNA, for 
5 example an mRNA, it will be possible to use, prior to the use of an amplification reaction with the aid 
of at least one primer according to the invention or to the use of a method of detection with the aid of 
at least one probe of the invention, a reverse transcriptase-type enzyme so as to obtain a cDNA from 
the RNA contained in the biological sample. The cDNA obtained will then serve as target for the 
primer(s) or the probe(s) used in the amplification or detection method according to the invention. 
1 0 The detection probe will be chosen so that it hybridizes with the target sequence or the 

amplicon generated from the target sequence. Such a detection probe will advantageously have as 
sequence a sequence of at least 12 nucleotides, in particular of at least 20 nucleotides, and preferably 
at least 100 nucleotides. 

The invention also comprises the nucleotide sequences which can be used as probe or 
15 primer according to the invention, characterized in that they are labelled with a radioactive compound 
or with a nonradioactive compound. 

The nonlabelled nucleotide sequences may be used directly as probes or primers; 
however, the sequences are generally labelled with a radioactive element ( 32 P, 35 S, 3 H, ,25 I) or with a 
nonradioactive molecule (biotin, acetylaminofluorene, digoxigenin, 5-bromo-deoxyuridine, 

2 0 fluorescein), so as to obtain probes which can be used in numerous applications. 

Examples of nonradioactive labelling of nucleotide sequences are described, for example, 
in French patent No. 78,10975 or by Urdea et al. or by Sanchez-Pescador et al. in 1988. 

In the latter case, one of the labelling methods described in patents FR-2 422 956 and 
FR-2 518 755 may also be used. 
25 The invention also relates to the nucleotide sequences of fragments which can be 

obtained by hybridization with the aid of at least one probe according to the invention. 

The hybridization technique may be performed in various ways (Matthews et al., 1988). 
The most common method consists in immobilizing the nucleic acid extracted from 
Chlamydia pneumoniae cells on a support (such as nitrocellulose, nylon, polystyrene) and in 

3 0 incubating, under well-defined conditions, the target nucleic acid immobilized with the probe. After 

hybridization, the excess probe is removed and the hybrid molecules formed are detected by the 
appropriate method (measurement of the radioactivity, of the fluorescence or of the enzymatic activity 
linked to the probe). 

The invention also comprises the nucleotide sequences according to the invention, 
3 5 characterized in that they are covalently or noncovalently immobilized on a support. 

According to another advantageous embodiment of the nucleic sequences according to 
the invention, the latter may be used immobilized on a support and may thus serve to capture, through 
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specific hybridization, the target nucleic acid obtained from the biological sample to be tested. 

If necessary, the solid support is separated from the sample and the hybridization complex formed 

between the so-called capture probe and the target nucleic acid is then detected by means of a second 

probe, called detection probe, labelled with an easily detectable element. 
5 The nucleotide sequences according to the invention may also be used in new analytical 

systems, DNA chips, which allow sequencing, the study of mutations and of the expression of genes, 
and which are currently of interest given their very small size and their high capacity in terms of 
number of analyses. 

The principle of the operation of these chips is based on molecular probes, most often 
10 oligonucleotides, which are attached onto a miniaturized surface, generally of the order of a few 
square centimetres. During an analysis, a sample containing fragments of a target nucleic acid to be 
analysed for example DNA or RNA labelled, for example, after amplification, is deposited onto the 
DNA chip in which the support has been coated beforehand with probes. Bringing the labelled target 
sequences into contact with the probes leads to the formation, through hybridization, of a duplex 
15 according to the rule of pairing defined by J.D. Watson and F. Crick. After a washing step, analysts of 
the surface of the chip allows the effective hybridizations to be located by means of the signals enutted 
by the labels tagging the target. A hybridization fingerprint results from this analysis wh.ch, by 
appropriate computer processing, will make it possible to determine information such as the presence 
of specific fragments in the sample, the determination of sequences and the presence of mutahons. 
20 The chip consists of a multitude of molecular probes, precisely organized or arrayed on a 

solid support whose surface is miniaturized. It is at the centre of a system where other elements 
paging system, microcomputer) allow the acquisition and interpretation of a hybruhzatton 

fingerprint. . 

The hybridization supports are provided in the form of flat or porous surfaces (p.erced 

25 with wells) composed of various material, The choice of a support is determined by ns 
physicochemical properties, or more precisely, by the relationship between the latter and the 
conditions under which the support will be placed during the synthesis or the attachment of the probes 
or during the use of the chip. It is therefore necessary, before considering the use of a particular 
support (R.S. Matson et al., 1994), to consider characteristics such as its stability to pH, tts phys.cal 
3 0 strength, its reactivity and its chemical stability as well as its capacity to nonspecifically bind nucle,c 
acid, Materials such as glass, silicon and polymers are commonly used. Their surface is, in a first 
step called "functionalization", made reactive towards the groups which it is desired to attach thereon. 
After the functionalization, so-called spacer molecules are grafted onto the activated surface. Used as 
intermediates between the surface and the probe, these molecules of variable size render unimportant 
35 the surface properties of the supports, which often prove to be problematic for the synthesis or the 
attachment of the probes and for the hybridization. 

Among the hybridization supports, there may be mentioned glass which is used, for 
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example, in the method of in situ synthesis, of oligonucleotides by photochemical addressing 
developed by the company Affymetrix (E.L. Sheldon, 1993), the glass surface being activated by 
silane. Genosensor Consortium (P. Merel, 1994) also uses glass slides carrying wells 3 mm apart, this 
support being activated with epoxysilane. 
5 Polymers or silicon may also be mentioned among these hybridization supports. For 

example, the Andrein Mirzabekov team has developed a chip consisting of poly aery lamide squares 
polymerized on a silanized glass surface (G. Yershov et aL, 1996). Several teams use silicon, in 
particular the IFOS laboratory of Ecole Centrale of Lyon which uses a silicon semiconductor substrate 
which is p-doped by introducing it into its crystalline structure atoms whose valency is different from 

10 that of silicon. Various types of metals, in particular gold and platinum, may also be used as support 
(Genosensor Consortium (K. Beattie et al,, 1993)). 

The probes according to the invention may be synthesized directly in situ on the supports 
of the DNA chips. This in situ synthesis may be carried out by photochemical addressing (developed 
by the company Affymax (Amsterdam, Holland) and exploited industrially by its subsidiary 

15 Affymetrix (United States)) or based on the VLSIPS (very large scale immobilized polymer synthesis) 
technology (S.P.A. Fodor etal., 1991) which is based on a method of photochemically directed 
combinatory synthesis and the principle of which combines solid-phase chemistry, the use of 
photolabile protecting groups and photolithography. 

The probes according to the invention may be attached to the DNA chips in various ways 

20 such as electrochemical addressing, automated addressing or the use of probe printers 
(T. Livache et al., 1994; G. Yershov et al., 1996; J. Derisi et al., 1996, and S. Borman, 1996). 

The revealing of the hybridization between the probes of the invention, deposited or 
synthesized in situ on the supports of the DNA chips, and the sample to be analysed, may be 
determined, for example, by measurement of fluorescent signals, by radioactive counting or by 

2 5 electronic detection. 

The use of fluorescent molecules such as fluorescein constitutes the most common 
method of labelling the samples. It allows direct or indirect revealing of the hybridization and allows 
the use of various fluorochromes. 

Affymetrix currently provides an apparatus or a scanner designed to read its Gene Chip™ 

3 0 chips. It makes it possible to detect the hybridizations by scanning the surface of the chip in confocal 

microscopy (R.J. Lipshutz etal., 1995). Other methods of detecting fluorescent signals have been 
tested: coupling of an epifluorescence microscope and a CCD camera (G. Yershov et al., 1996), the 
use of an optical fibre collecting system (E.L. Sheldon, 1993). A conventional method consists in 
carrying out an end labelling, with phosphorus 32, of the target sequences, by means of an appropriate 
35 apparatus, the Phosphorimager (marketed by Molecular Dynamics). The electronic detection is based 
on the principle that the hybridization of two nucleic acid molecules is accompanied by physical 
phenomena which can be quantified under certain conditions (system developed by Ecole Centrale of 
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Lyon and called GEN-FET (GEN field effect transistor)). Genosensor Consortium and the 
company Beckman Instruments who are developing an electronic chip or Permittivity Chips™ may 

also be mentioned (K. Beattie et al., 1993). 

The nucleotide sequences according to the invention may thus be used in DNA ch,ps to 
5 carry out the analysis of mutations. This analysis is based on the production of chips capable of 
analysing each base of a nucleotide sequence according to the invention. 

The nucleotide sequences according to the invention may also be used in DNA chips to 
carry out the analysis of the expression of the Chlamydia pneumoniae genes. This analysis of the 
expression of Chlamydia pneumoniae genes is based on the use of chips where probes of the 
10 invention, chosen for their specificity to characterize a given gene, are present (DJ. Lockhart et a. 
1996- D D Shoemaker et al., 1996). For the methods of analysis of gene expression using the DNA 
chips reference may, for example, be made to the methods described by D.J. Lockhart et al. (1996) 
and Sosnowsky et al. (1997) for the synthesis of probes in situ or for the addressing and the attachment 
of previously synthesized probes. The target sequences to be analysed are labelled and in general 
15 fragmented into sequences of about 50 to 100 nucleotides before being hybridized onto the chip. After 
washing as described, for example, by DJ. Lockhart et al. (1996) and application of different electric 
fields (Sosnowsky et al., 1997), the labelled compounds are detected and quantified, the hybridizations 
being carried out at least in duplicate. Comparative analyses of the signal intensities obtained with 
respect to the same probe for different samples and/or for different probes with the same sample. 
2 0 determine the differential expression of RN A or of DNA derived from the sample. 

The nucleotide sequences according to the invention may, in addition, be used m DNA 
chips where other nucleotide probes specific for other microorganisms are also present, and may allow 
,he carrying out of a serial test allowing rapid identification of the presence of a microorganism in a 
sample. 

25 Accordingly, the subject of the invention is also the nucleotide sequences according to 

the invention, characterized in that they are immobilized on a support of a DNA chip. 

The DNA chips, characterized in that they contain at least one nucleotide sequence 
according to the invention, immobilized on the support of the said chip, also form part of the 
invention. 

30 The said chips will preferably contain several probes or nucleotide sequences of me 

invention of different length and/or corresponding to different genes so as to identify, with greater 
certainty, the specificity of the target sequences or the desired mutation in the sample to be analysed. 

Accordingly, the analyses carried out by means of primers and/or probes according to the 
invention, immobilized on supports such as DNA chips, will make it possible, for example, to identify, 
35 in samples, mutations linked to variations such as intraspecies variations. These variations may be 
correlated or associated with pathologies specific to the variant identified and will make it possible to 
select the appropriate treatment. 
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The invention thus comprises a DNA chip according to the invention, 
characterized in that it contains, in addition, at least one nucleotide sequence of a microorganism 
different from Chlamydia pneumoniae, immobilized on the support of the said chip; preferably, the 
different microorganism will be chosen from an associated microorganism, a bacterium of the 
5 Chlamydia family, and a variant of the species Chlamydia pneumoniae. 

Another subject of the present invention is a vector for the cloning and/or the expression 
of a sequence, characterized in that it contains a nucleotide sequence according to the invention. 
Among the said vectors according to the invention, the vectors containing a nucleotide sequence 
encoding a polypeptide of the cellular, preferably outer, envelope of Chlamydia pneumoniae or one of 

10 its representative fragments, are preferred. In a specific embodiment, the vectors contain a nucleotide 
sequence encoding a Chlamydia pneumoniae secreted polypeptide or one of its representative 
fragments or encoding a transport polypeptide, a surface exposed polypeptide, a lipoprotein or one of 
its representative fragments, a polypeptide involved in lipopolysaccharide (LPS) biosynthesis, a Type 
III and non-Type III secreted polypeptide, a polypeptide containing RGD attachment sites, a cell wall 

15 anchored surface polypeptide, a polypeptide not found in Chlamydia trachomatis, a ribosomal 
polypeptide or a polypeptide involved in secretion, transcription, translation, maturation of proteins, a 
polypeptide involved in the synthesis of the wall, a polypeptide involved in the virulence, a 
polypeptide involved in the intermediate metabolism, in particular in the metabolism of sugars and/or 
of cofactors, a polypeptide involved in the metabolism of nucleotides, of amino acids, of nucleic acids 

20 or of fatty acids of Chlamydia pneumoniae or one of their representative fragments, or a polypeptide 
specific to Chlamydia pneumoniae. 

According to the invention, the vectors comprise the elements necessary to allow the 
expression and/or the secretion of the said nucleotide sequences in a given host cell, and form part of 
the invention. The vector should, in this case, comprise a promoter, signals for initiation and for 

25 termination of translation, as well as appropriate regions for regulation of transcription. It should be 
capable of being stably maintained in the host cell and may optionally possess particular signals 
specifying the secretion of the translated protein. These different elements are chosen according to the 
host cell used. To this effect, the nucleotide sequences according to the invention may be inserted into 
autonomously-replicating vectors within the chosen host, or integrative vectors in the chosen host. 

30 Any of the standard methods known to those skilled in the art for the insertion of DNA 

fragments into a vector may be used to construct expression vectors containing a chimeric gene 
consisting of appropriate transcriptional/translational control signals and the protein coding sequences. 
These methods may include in vitro recombinant DNA and synthetic techniques and in vivo 
recombinants (genetic recombination). 

35 Expression of a polypeptide, peptide or derivative, or analogs thereof encoded by a 

polynucleotide sequence in SEQ ID No. 1 or ORFs contained within SEQ ID No. 1 may be regulated 
by a second nucleic acid sequence so that the protein or peptide is expressed in a host transformed 
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with the recombinant DNA molecule. For example, expression of a protein or peptide may 
be controlled by any promoter/enhancer element known in the art. Promoters which may be used to 
control expression include, but are not limited to, the CMV promoter, the SV40 early promoter region 
(Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained in the 3' long terminal 
5 repeat of Rous sarcoma virus (Yamamoto, et aL, 1980, Cell 22:787-797), the herpes thymidine kinase 
promoter (Wagner et aL, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences 
of the metallothionein gene (Brinster * aL, mi, Nature 296:39-42); prokaryotic expression vectors 
such as the 3-lactamase promoter (Villa-Kamaroff, et aL, 1978, Proc. Natl. Acad. Sci. U.S.A. 
75-3727-3731), or the too promoter (DeBoer, et aL, 1983, Proc. Natl. Acad. Sci. U.S.A. §0:21-25); see 
10 also "Useful proteins from recombinant bacteria" in Scientific American, 1980, 242:74-94; plant 
expression vectors comprising the nopaline synthetase promoter region (Herrera-Estrella et aL, 1983, 
Nature 303:209-213) or the cauliflower mosaic virus 35S RNA promoter (Gardner, el aL, 1981, Nucl. 
Acids Res. 9:2871), and the promoter of the photosynthetic enzyme ribulose biphosphate carboxylase 
(Herrera-Estrella et aL, 1984, Nature 310:1 15-120); promoter elements from yeast or other fungi such 
15 as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) 
promoter, alkaline phosphatase promoter, and the following animal transcriptional control regions, 
which exhibit tissue specificity and have been utilized in transgenic animals: elastase I gene control 
region which is active in pancreatic acinar cells (Swift et aL, 1984, Cell 38:639-646; Ornitz et aL, 
1986 Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, Hepatology 7:425-515); 
20 insulin gene control region which is active in pancreatic beta cells (Hanahan, 1985, Nature 215:115- 
122) immunoglobulin gene control region which is active in lymphoid cells (Grosschedl et aL, 1984, 
Cell 38 647-658; Adames et aL, 1985, Nature 31&533-538; Alexander et aL, 1987, Mol. Cell. Biol. 
7 1436-1444), mouse mammary tumor virus control region which is active in testicular, breast, 
lymphoid and mast cells (Leder * aL, 1986, Cell 45:485-195), albumin gene control region which is 
25 active in liver (Pinkert et aL, 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene control 
region which is active in liver (Krumlauf et aL, 1985, Mol. Cell. Biol. 5:1639-1648; Hammer et aL, 
1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is active in the liver (Kelsey et 
al 1987 Genes and Devel. 1:161-171), beta-globin gene control region which is active in myeloid 
ceils (Mogram et aL, 1985, Nature 315:338-340; Kollias et aL, 1986, Cell 46:89-94; myelin basic 
30 protein gene control region which is active in oligodendrocyte cells in the brain (Readhead et aL, 
1987, Cell 48:703-712); myosin light chain-2 gene control region which is active in skeletal muscle 
(Sani, 1985, Nature 314:283-286), and gonadotropic releasing hormone gene control region which is 
active in the hypothalamus (Mason et aL, 1986, Science 234:1372-1378). 

The vectors according to the invention are, for example, vectors of plasmid or viral 
35 origin In a specific embodiment, a vector is used that comprises a promoter operably linked to a 
protein or peptide-encoding a nucleic acid sequence in SEQ ID No. 1, or ORFs contained within SEQ 
ID No. 1, one or more origins of replication, and, optionally, one or more selectable markers (e.g., an 
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antibiotic resistance gene). Expression vectors comprise regulatory sequences that control gene 
expression, including gene expression in a desired host cell. Preferred vectors for the expression of 
the polypeptides of the invention include the pET-type plasmid vectors (Promega) or pBAD plasmid 
vectors (Invitrogen). Furthermore, the vectors according to the invention are useful for transforming 
5 host cells so as to clone or express the nucleotide sequences of the invention. 

Expression can also be achieved using targeted homologous recombination to activate 
Chlamydia pneumoniae genes present in the cloned genomic DNA. A heterologous regulatory 
element may be inserted into a stable cell line or cloned microorganism, such that it is operatively 
linked with an endogenous Chlamydia pneumoniae gene present in the cloned genome, using 

10 techniques, such as targeted homologous recombination, which are well known to those of skill in the 
art {See, e.g., Chappel, U.S. Patent No. 4,215,051 and Skoultchi, WO 91/06667 each of which is 
incorporated herein in its entirety). 

Expression vector/host cell systems containing inserts of polynucleotide sequences in 
SEQ ID No. 1 or ORFs within SEQ ID No. 1, which encode polypeptides, peptides or derivatives, or 

15 analogs thereof, can be identified by three general approaches: (a) nucleic acid hybridization, (b) 
presence or absence of "marker" gene functions, and (c) expression of inserted sequences. In the first 
approach, the presence of a polynucleotide sequence inserted in an expression vector can be detected 
by nucleic acid hybridization using probes comprising sequences that are homologous to an inserted 
polynucleotide sequence. In the second approach, the recombinant vector/host system can be 

20 identified and selected based upon the presence or absence of certain "marker" gene functions (e.g., 
thymidine kinase activity, resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, etc.) caused by the insertion of a polynucleotide sequence in the vector. For 
example, if the polynucleotide sequence in SEQ ID No. 1 or ORFs within SEQ ID No. 1 is inserted 
within the marker gene sequence of the vector, recombinants containing the insert can be identified by 

2 5 the absence of the marker gene function. In the third approach, recombinant expression vectors can be 
identified by assaying the product of the polynucleotide sequence expressed by the recombinant. Such 
assays can be based, for example, on the physical or functional properties of the expressed polypeptide 
in in vitro assay systems, e.g., binding with antibody, promotion of cell proliferation. 

Once a particular recombinant DNA molecule is identified and isolated, several methods 

30 known in the art may be used to propagate it. The clones identified may be introduced into an 
appropriate host cell by standard methods, such as for example lipofection, electroporation, and heat 
shock. Once a suitable host system and growth conditions are established, recombinant expression 
vectors can be propagated and prepared in quantity. 

The invention also encompasses the host cells transformed by a vector according to the 

35 invention. These cells may be obtained by introducing into host cells a nucleotide sequence inserted 
into a vector as defined above, and then culturing the said cells under conditions allowing the 
replication and/or the expression of the transfected nucleotide sequence. 
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The host cell may be chosen from eukaryotic or prokaryotic systems, such as for 
example bacterial cells (Olins and Lee, 1993), but also yeast cells (Buckholz, 1993), as well as animal 
cells in particular cultures of mammalian cells (Edwards and Aruffo, 1993), and in particular Chmese 
hamster ovary (CHO) cells, but also insect cells in which methods using baculoviruses for example 

5 may be used (Luckow, 1993). 

Furthermore, a host cell strain may be chosen which modulates the expressron of the 
inserted sequences, or modifies and processes the gene product in the specific fashion des^d. 
Expression from certain promoters can be elevated in the presence of certain inducers; thus 
expression of the genetically engineered polypeptide may be controlled. Furthermore, Afferent hos 
10 ceL have characteristic and specific mechanisms for the translationa, and post-translaUona, 
processing and modification (e.g., glycosylate, phosphorylation) of proteins. Appropriate cell hues 
or host systems can be chosen to ensure the desired modification and processing of the foreign protem 
expressed For example, expression in a bacteria, system can be used to produce an unglycosylated 
col protein product. Expression in yeast will produce a glycosylated product. Expression m 
! 5 mammalian cells can be used to ensure "native" glycosylate of a heterologous protein. Furmermore, 
different vector/host expression systems may effect processing reactions to different extents. 

A preferred host cell for the expression of the proteins of the invention cons.sts of 
prokaryotic cells, such as Gram" bacteria. A further preferred host cell according to the invenuon 
bacterium belonging to the Chlamydia family, more preferably belonging to the species CHla^a 
2 0 pneumoniae or chosen from a microorganism associated with the species Chlamydia pneumoniae. 

In other specific embodiments, the polypeptides, peptides or derivatives, or analogs 
thereof may be expressed as a fusion, or chimeric protein product (comprising the protein, fragment, 
analog, or derivative Joined via a peptide bond to a heterologous protein sequence (of a Afferent 
protein)). Such a chimeric product can be made by ligating the appropriate nucleic acid sequences 
25 encoding the desired amino acid sequences to each other by methods known in the art, in the proper 
coding frame, and expressing the chimeric product by methods commonly known ,n the art. 
Alternatively, such a chimeric product may be made by protein synthetic techniques, ,g., by use of a 
peptide synthesizer. 

Genomic sequences can be cloned and expressed as translation^ gene products {,e., 
30 peptides, polypeptides, and proteins) or transcriptional gene products (/.,, antisense and ribozymes). 

The invention further relates to the intracellular production of an antisense nuclei add 
sequence of SEQ ID No. 1 by transcription from an exogenous sequence. For example, a vector can 
be introduced in vivo such that it is taken up by a cell, within which cell the vector or a portion thereof 
is transcribed, producing an antisense nucleic acid (RNA) of the invention. Such a vector would 
3 5 contain a sequence encoding an antisense nucleic acid. Such a vector can remain episomal or become 
chromosomally integrated, as long as it can be transcribed to produce the desired antisense RNA. 
Such vectors can be constructed by recombinant DNA technology methods standard ,n the art. 
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Vectors can be plasmid, viral, or others known in the art, used for replication and expression in 
mammalian cells. Expression of the sequence encoding the an antisense RNA can be by any promoter 
known in the art to act in mammalian, preferably human, cells. Such promoters can be inducible or 
constitutive. Such promoters include but are not limited to: the CMV promoter, the SV40 early 
5 promoter region (Bernoist and Chambon, 1981, Nature 290:304-310). the promoter contained in the 
3N long terminal repeat of Rous sarcoma virus (Yamamoto et aL, 1980, Cell 22:787-797), the herpes 
thymidine kinase promoter (Wagner et ai, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the 
regulatory sequences of the metallothionein gene (Brinster et ai, 1982, Nature 296 :39-42). etc. 

In a specific embodiment, the antisense oligonucleotide comprises catalytic RNA, or a 
10 ribozyme (see, e.g., PCT International Publication WO 90/11364, published October 4, 1990; Sarver 
et ai, 1990, Science 247:1222-1225). In another embodiment, the oligonucleotide is a 2N-0- 
methylribonucleotide (Inoue et ai, 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
analog (Inoue et ai, 1987, FEBS Lett. 215:327-330). 

In another embodiment, the antisense nucleic acids of the invention comprise a sequence 
15 complementary to at least a portion of an RNA transcript of a polynucleotide sequence in SEQ ID No. 
1. However, absolute complementarity, although preferred, is not required. A sequence 
"complementary to at least a portion of an RNA," as referred to herein, means a sequence having 
sufficient complementarity to be able to hybridize with the RNA, forming a stable duplex; in the case 
of double-stranded antisense nucleic acid sequence, a single strand of the duplex DNA may thus be 
20 tested, or triplex formation may be assayed. The ability to hybridize will depend on both the degree of 
complementarity and the length of the antisense nucleic acid. Generally, the longer the hybridizing 
nucleic acid, the more base mismatches with an RNA transcribed from SEQ ID No. 1 may contain and 
still form a stable duplex (or triplex, as the case may be). One skilled in the art can ascertain a 
tolerable degree of mismatch by use of standard procedures to determine the melting point of the 
2 5 hybridized complex. 

The invention also relates to the animals, except humans, comprising one of the above- 
described transformed cells according to the invention. 

The production of transgenic animals according to the invention overexpressing one or 
more of the Chlamydia pneumoniae genes will be preferably carried out on rats, mice or rabbits 
30 according to methods well known to persons skilled in the art such as viral or nonviral transfections. 
The transgenic animals overexpressing one or more of the said genes may be obtained by transfection 
of multiple copies of the said genes under the control of a powerful promoter of a ubiquitous nature, or 
which is selective for one type of tissue. The transgenic animals may also be obtained by homologous 
recombination on embryonic stem cells, transfer of these stem cells to embryos, selection of the 
35 chimeras affected at the level of the reproductive lines, and growth of the said chimeras. 

The transformed cells as well as the transgenic animals according to the invention can be 
used in methods of preparing the recombinant polypeptide. 
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It is now possible to produce recombinant polypeptides in a relatively large 
quantity by genetic engineering using the cells transformed with expression vectors according to the 
invention or using transgenic animals according to the invention. 

The methods of preparing a polypeptide of the invention in recombinant form, 
5 characterized in that they use a vector and/or a cell transformed with a vector according to the 
invention and/or a transgenic animal comprising one of the said transformed cells accordmg to the 
invention, are themselves included in the present invention. 

Among the said methods of preparing a polypeptide of the invention in recombinant 
form the methods of preparation using a vector, and/or a ce.l transformed with the said vector and/or a 
10 transgenic animal comprising one of the said transformed cells, containing a nucleotide sequence 
encoding a polypeptide of the cellular envelope of Chlamydia pneumoniae or one of its representee 
fragments, more preferably encoding a polypeptide of the outer cellular envelope of Chlamydta 
pneumoniae or one of its fragment, are preferred. 

Among the said methods of preparing a polypeptide of the invention in recombinant 
1 5 form the methods of preparation using a vector, and/or a cell transformed with the said vector and/or a 
transgenic animal comprising one of the said transformed cells, containing a nucleotide sequence 
encoding a Chlamydia pneumoniae secreted polypeptide or one of its representative fragments or 
encoding a transport polypeptide, a surface exposed polypeptide, a lipoprotein or one of .ts 
representative fragments, a polypeptide involved in lipopolysaccharide biosynthesis, a Type ffl or 
20 other secreted polypeptide, a polypeptide containing ROD attachment sites, a cell wall anchored 
surface polypeptide, a polypeptide not found in Chlamydia trachomatis, a ribosomal polypepttde or a 
polypeptide involved in secretion, transcription, translation, maturation of proteins, a polypeptide 
involved in the synthesis of the wall, a polypeptide involved in the virulence, a polypeptide mvolved 
in the intermediate metabolism, in particular in the metabolism of sugars and/or of cofactors, a 
25 polypeptide involved in the metabolism of nucleotides, of amino acids, of nucleic acids or of fatty 
acids of Chlamydia pneumoniae or one of their representative fragments, or a polypeptide specdic to 

Chlamydia pneumoniae, are also preferred. 

The recombinant polypeptides obtained as indicated above may be provided either m 
glycosylated or non-glycosylated form and may or may not have the natural tertiary structure. 
30 A preferred variant consists in producing a recombinant polypeptide fused to a "earner 

protein (chimeric protein). The advantage of this system is that it allows a stabilization and a 
reduction in proteolysis of the recombinant product, an increase in solubility during renaturafon 
in vitro and/or a simplification of purification when the fusion partner has affinity for a specific hgand. 

More particularly, the invention relates to a method of preparing a polypeptide of the 

3 5 invention comprising the following steps: 

a) culture of the transformed cells under conditions allowing the expression of a recombinant 
polypeptide having a nucleic acid sequence according to the invention; 
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b) where appropriate, recovery of the said recombinant polypeptide. 

When the method of preparing a polypeptide of the invention uses a transgenic animal 
according to the invention, the recombinant polypeptide is then extracted from the said animal. 

The subject of the invention is also a polypeptide capable of being obtained by a method 
5 of the invention as described above. 

The invention also comprises a method of preparing a synthetic polypeptide, 
characterized in that it uses an amino acid sequence of polypeptides according to the invention. 

The invention also relates to a synthetic polypeptide obtained by a method according to 
the invention. 

10 Polypeptides according to the invention may also be prepared by conventional techniques 

in the field of peptide synthesis under conditions suitable to produce the polypeptides encoded by the 
polynucleotide of the invention. This synthesis may be carried out in and recovered from a 
homogeneous solution or on a solid phase. 

For example, the synthesis technique in a homogeneous solution described by 
1 5 Houbenweyl in 1 974 may be used. 

This method of synthesis consists in successively condensing, in pairs, the successive 
amino acids in the required order, or in condensing amino acids and fragments previously formed and 
already containing several amino acids in the appropriate order, or alternatively several fragments thus 
previously prepared, it being understood that care will have been taken to protect beforehand all the 
20 reactive functional groups carried by these amino acids or fragments, with the exception of the amine 
functional groups of one and the carboxyl functional groups of the other or vice versa, which should 
normally take part in the formation of the peptide bonds, in particular after activation of the carboxyl 
functional group, according to methods well known in peptide synthesis. 

According to another preferred technique of the invention, the one described by 
2 5 Merrifield is used. 

To manufacture a peptide chain according to the Merrifield method, a highly porous 
polymer resin is used, onto which the first C-terminal amino acid of the chain is attached. This amino 
acid is attached onto a resin via its carboxyl group and its amine functional group is protected. The 
amino acids which will constitute the peptide chain are thus attached, one after another, onto the amine 
30 group, each time deprotected beforehand, of the portion of the peptide chain already formed, and 
which is attached to the resin. When the entire peptide chain desired is formed, the protecting groups 
are removed from the various amino acids constituting the peptide chain and the peptide is detached 
from the resin with the aid of an acid. 

The invention relates, in addition, to hybrid (fusion) polypeptides having at least one 
35 polypeptide or one of its representative fragments according to the invention, and a sequence of a 
polypeptide capable of eliciting an immune response in humans or animals. 

Advantageously, the antigenic determinant is such that it is capable of eliciting a humoral 
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and/or cellular response. An antigenic determinant may be identified by screening 
expression libraries of the Chlamydia pneumoniae genome with antibodies contained in the serum of 
patients infected with a bacterium belonging to the species Chlamydia pneumoniae. An antigenic 
determinant may comprise a polypeptide or one of its representative fragments according to the 
5 invention, in glycosylated form, used in order to obtain immunogenic compositions capable of 
inducing the synthesis of antibodies directed against multiple epitopes. The said polypeptides or their 
glycosylated fragments also form part of the invention. 

These hybrid molecules may consist, in part, of a carrier molecule for polypeptides or for 
their representative fragments according to the invention, combined with a portion which may be 
10 immunogenic, in particular an epitope of the diphtheria toxin, the tetanus toxin, a hepatitis B virus 
surface antigen (patent FR 7921811), the poliomyelitis vims VP1 antigen or any other v.ral or 

bacterial toxin or antigen. 

The methods of synthesizing the hybrid molecules include the methods used in genetic 
engineering to construct hybrid nucleotide sequences encoding the desired polypeptide sequences. 
15 Reference may be advantageously made, for example, to the technique for producing genes encoding 
fusion proteins described by Minton in 1984. 

The said hybrid nucleotide sequences encoding a hybrid polypeptide as well as the hybnd 
polypeptides according to the invention, characterized in that they are recombinant polypeptides 
obtained by the expression of the said hybrid nucleotide sequences, also form part of the invention. 
2 o The invention also comprises the vectors characterized in that they contain one of the sa,d 

hybrid nucleotide sequences. The host cells transformed by the said vectors, the transgenic animals 
comprising one of the said transformed cells as well as the methods of preparing recombinant 
polypeptides using the said vectors, the said transformed cells and/or the said transgenic animals of 

course also form part of the invention. 
25 The polypeptides according to the invention, the antibodies according to the invention 

described below and the nucleotide sequences according to the invention may advantageously be used 
in in vitro and/or in vivo methods for the detection and/or the identification of bacteria belonging to 
the species Chlamydia pneumoniae, in a biological sample (biological tissue or fluid) which ,s likely 
to contain them. These methods, depending on the specificity of the polypeptides, of the antibodies 
30 and of the nucleotide sequences according to the invention which will be used, may in particular detect 
and/or identify the bacteria, variants belonging to the species Chlamydia pneumoniae as well as the 
associated microorganisms capable of being detected by the polypeptides, the antibodies and the 
nucleotide sequences according to the invention which will be chosen. It may, for example, be 
advantageous to choose a polypeptide, an antibody or a nucleotide sequence according to the 
35 invention, which is capable of detecting any bacterium of the Chlamydia family by choosing a 
polypeptide, an antibody and/or a nucleotide sequence according to the invention which is specific to 
the family or, on the contrary, it will be most particularly advantageous to target a variant of the 
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species Chlamydia pneumoniae, which is responsible, for example, for the induction or the 
worsening of pathologies specific to the targeted variant, by choosing a polypeptide, an antibody 
and/or a nucleotide sequence according to the invention which is specific to the said variant. 

The polypeptides according to the invention may advantageously be used in a method for 
5 the detection and/or the identification of bacteria belonging to the species Chlamydia pneumoniae or 
to an associated microorganism, in a biological sample (biological tissue or fluid) which is likely to 
contain them, characterized in that it comprises the following steps: 

a) bringing this biological sample into contact with a polypeptide or one of its representative 
fragments according to the invention (under conditions allowing an immunological reaction between 

10 the said polypeptide and the antibodies which may be present in the biological sample); 

b) detecting the antigen-antibody complexes which may be formed. 

Preferably, the biological sample consists of a fluid, for example a human or animal 
serum, blood or biopsies. 

Any conventional procedure may be used to carry out such a detection of the antigen- 
1 5 antibody complexes which may be formed. 

By way of example, a preferred method uses immunoenzymatic procedures based on the 
ELISA technique, immunofluorescence procedures or radioimmunological procedures (RIA), and the 
like. 

Accordingly, the invention also relates to the polypeptides according to the invention, 
20 labelled with the aid of a suitable label such as a label of the enzymatic, fluorescent or radioactive 
type. 

Such methods comprise, for example, the following steps: 
- deposition of defined quantities of a polypeptide composition according to the invention into the 
wells of a microti tre plate, 

25 - introduction, into the said wells, of increasing dilutions of serum, or of a different biological 
sample as defined above, which has to be analysed, 
incubation of the microplate, 

introduction, into the wells of the microtitre plate, of labelled antibodies directed against human or 
animal immunoglobulins, these antibodies having been labelled with the aid of an enzyme 
30 selected from those which are capable of hydrolyzing a substrate, thereby modifying the 

absorption of the radiation of the latter, at least at a defined wavelength, for example at 
550 nm, 

detection, by comparison with a control, of the quantity of substrate hydrolyzed. 

The invention also relates to a kit or set for the detection and/or the identification of 
35 bacteria belonging to the species Chlamydia pneumoniae or to an associated microorganism, 
characterized in that it comprises the following components: 
a polypeptide according to the invention, 
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. where appropriate, the reagents for constituting the medium appropriate for the 
immunological or specific reaction, 
the reagents allowing the detection of the antigen-antibody complexes produced by the immuno- 
logical reaction between the polypeptide(s) of the invention and the antibodies wh,ch may be 
5 present in the biological sample, it being possible for these reagents also to carry a label, or to 

be capable of being recognized in turn by a labelled reagent, more particularly in the case 
where the polypeptide according to the invention is not labelled, 
. where appropriate, a reference biological sample (negative control) free of antibodies recognized 
by a polypeptide according to the invention, 
10 - where appropriate, a reference biological sample (positive control) containing a predetermined 
quantity of antibodies recognized by a polypeptide according to the invention. 

According to the invention, the polypeptides, peptides, fusion proteins or other 
derivatives, or analogs thereof encoded by a polynucleotide sequence in SEQ ID No. 1, may be used 
as an immunogen to generate antibodies which immunospecifically bind such an immunogen. Such 
15 antibodies may include, but are not limited to, polyclonal and monoclonal antibodies, humamzed or 
chimeric antibodies, single chain antibodies, Fab fragments, F(ab) 2 fragments, fragments produced by 
a Fab expression library, anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of 
the above In a specific embodiment, the antibody to a polypeptide, peptide or other derivatwe, or 
analog thereof encoded by a polynucleotide sequence in SEQ ID No. 1 is a bispecific antibody (see 
20 generally, e.g. Fanger and Drakeman, 1995, Dru g News and Perspecti.es Si 133-137). Such a 
bispecific antibody is genetically engineered to recognize both (1) an epitope and (2) one of a vanety 
of "trigger" molecules, ,g. Fc receptors on myeloid cells, and CD3 and CD2 on T cells, that have been 
identified as being able to cause a cytotoxic T-cell to destroy a particular target. Such bispecrfic 
antibodies can be prepared either by chemical conjugation, hybridoma, or recombinant molecular 
25 biology techniques known to the skilled artisan. 

Various procedures known in the art may be used for the production of polyclonal 
antibodies to a polypeptide, peptide or other derivative, or analog thereof encoded by a polynucleot.de 
sequence in SEQ ID No. 1. For the production of antibody, various host animals can be immumzed by 
injection with a polypeptide, or peptide or other derivative, or analog thereof, including but not hunted 
30 to rabbits, mice, rats, etc. Various adjuvants, depending on the host species, may be used to mcrease 
the immunological response, including but not limited to Stimu.on™ QS-21 (Aquda 
Biopharmaceuticals, Inc., Framingham, MA), MPL™ (3-O-deacylated monophosphoryl hp,d A; RBI 
InnnunoChem Research, Inc., Hamilton, MT), aluminum phosphate, 1L-12 (Genetics Institute, 
Cambridge, MA), Freunds (complete and incomplete), mineral gels such as aluminum hydrox.de, 
35 surface active substances such as lysolecithin, pluronic po.yols, polygons, peptides, o,l emu.stons, 
keyhole limpet hemocyanins, dinitrophenol, BCG (bacille Ca.mette-Guerin), and corynebactenum 
parvum Alternatively, polyclonal antibodies may be prepared by purifying, on an affinity column 
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onto which a polypeptide according to the invention has been previously attached, the 
antibodies contained in the serum of patients infected with a bacterium belonging to the species 
Chlamydia pneumoniae. 

For preparation of monoclonal antibodies directed toward a polypeptide, peptide or other 
5 derivative, or analog, any technique which provides for the production of antibody molecules by 
continuous cell lines in culture may be used. For example, the hybridoma technique originally 
developed by Kohler and Milstein (1975, Nature 256 :495-497), as well as the trioma technique, the 
human B-cell hybridoma technique (Kozbor et al. 9 1983, Immunology Today 4:72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole et ai, 1985, in Monoclonal 

10 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). In an additional embodiment of the 
invention, monoclonal antibodies can be produced in germ-free animals utilizing technology described 
in PCT/US 90/02545. In another embodiment of the invention, transgenic non-human animals can be 
used for the production of human antibodies utilizing technology described in WO 98/24893 and WO 
96/33735. According to the invention, human antibodies may be used and can be obtained by using 

15 human hybridomas (Cote et ai, 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030) or by transforming 
human B cells with EBV virus in vitro (Cole et aL y 1985, in Monoclonal Antibodies and Cancer 
Therapy , Alan R. Liss, pp. 77-96). In fact, according to the invention, techniques developed for the 
production of "chimeric antibodies" (Morrison et ai, 1984, PROC. NATL. ACAD. SCI. U.S.A. 
81:6851-6855; Neuberger et ai, 1984, Nature 312:604-608; Takeda et ai, 1985, Nature 314:452-454) 

20 by splicing the genes from a mouse antibody molecule specific for a polypeptide, peptide or other 
derivative, or analog together with genes from a human antibody molecule of appropriate biological 
activity can be used; such antibodies are within the scope of this invention. 

According to the invention, techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce polypeptide or peptide-specific single 

25 chain antibodies. An additional embodiment of the invention utilizes the techniques described for the 
construction of Fab expression libraries (Huse et ai, 1989, Science 246:1275-1281) to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity for polypeptides, 
derivatives, or analogs. 

Antibody fragments which contain the idiotype of the molecule can be generated by 

30 known techniques. For example, such fragments include but are not limited to: the F(ab')2 fragment 
which can be produced by pepsin digestion of the antibody molecule; the Fab' fragments which can be 
generated by reducing the disulfide bridges of the F(ab')2 fragment, the Fab fragments which can be 
generated by treating the antibody molecule with papain and a reducing agent, and Fv fragments. 

In addition, techniques have been developed for the production of chimerized {See, e.g. , 

35 Boss, M. et al., U.S. Patent No. 4,816,397; and Cabilly, S. et a!., U.S. Patent No. 5,585,089 each of 
which is incorporated herein by reference in its entirety) humanized antibodies {See, e.g., Queen, U.S. 
Patent No. 5,585,089, which is incorporated herein by reference in its entirety.) An immunoglobulin 



PCT7IB98/01890 

WO 99/27105 

62 

light or heavy chain variable region consists of a "framework" region interrupted by three 
hypervariable regions, referred to as complementarily determining regions (CDRs). The extent of the 
framework region and CDRs have been precisely defined ( See, "Sequences of Proteins of 
Immunological Interest", Kabat, E. et al., U.S. Department of Health and Human Services (1983). 
5 Briefly, humanized antibodies are antibody molecules from non-human species having one or more 
CDRs from the non-human species and a framework from a human immunoglobulin molecule. 

The antibodies of the invention may also be labelled in the same manner as described 
above for the nucleic probes of the invention such as an enzymatic, fluorescent or radioactive type 
labelling. 

10 The invention relates, in addition, to a method for the detection and/or the identification 

of bacteria belonging to the species Chlamydia pneumoniae or to an associated microorganism in a 
biological sample, characterized in that it comprises the following steps: 

a) bringing the biological sample (biological tissue or fluid) into contact with a mono- or polyclonal 

antibody according to the invention (under conditions allowing an immunological reaction 
15 between the said antibodies and the polypeptides of the bacterium belonging to the species 

Chlamydia pneumoniae or to an associated microorganism which may be present in the 
biological sample, that is, under conditions suitable for the formation of immune 
complexes); 

b) detecting the antigen-antibody complex which may be formed. 

20 Also falling within the scope of the invention is a kit or set for the detection and/or the 

identification of bacteria belonging to the species Chlamydia pneumoniae or to an associated 
microorganism, characterized in that it comprises the following components: 

- a polyclonal or monoclonal antibody according to the invention, labeled where appropriate; 

- where appropriate, a reagent for constituting the medium appropriate for carrying out the 
25 immunological reaction; 

a reagent allowing the detection of the antigen-antibody complexes produced by the 
immunological reaction, it being possible for this reagent also to carry a label, or to be 
capable of being recognized in turn by a labelled reagent, more particularly in the case where 
the said monoclonal or polyclonal antibody is not labelled; 
3 0 - where appropriate, reagents for carrying out the lysis of the cells in the sample tested. 

The principle of the DNA chip which was explained above may also be used to produce 
protein "chips" on which the support has been coated with a polypeptide or an antibody according to 
the invention, or arrays thereof, in place of the DNA. These protein "chips" ™ke it possible, for 
example, to analyze the biomolecular interactions (B1A) induced by the affinity capture of target 
35 analytes onto a support coated, for example, with proteins, by surface plasma resonance (SPR). 
Reference may be made, for example, to the techniques for coupling proteins onto a solid support 
which are described in EP524 800 or to the methods describing the use of biosensor-type protein 
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chips such as the BIAcore-type technique (Pharmacia) (Arlinghaus et al., 1997, Krone et al., 
1997, Chatelier et al., 1995). These polypeptides or antibodies according to the invention, capable of 
specifically binding antibodies or polypeptides derived from the sample to be analysed, may thus be 
used in protein chips for the detection and/or the identification of proteins in samples. The said protein 
5 chips may in particular be used for infectious diagnosis and may preferably contain, per chip, several 
polypeptides and/or antibodies of the invention of different specificity, and/or polypeptides and/or 
antibodies capable of recognizing microorganisms different from Chlamydia pneumoniae. 

Accordingly, the subject of the present invention is also the polypeptides and the 
antibodies according to the invention, characterized in that they are immobilized on a support, in 
1 0 particular of a protein chip. 

The protein chips, characterized in that they contain at least one polypeptide or one 
antibody according to the invention immobilized on the support of the said chip, also form part of the 
invention. 

The invention comprises, in addition, a protein chip according to the invention, 
15 characterized in that it contains, in addition, at least one polypeptide of a microorganism different 
from Chlamydia pneumoniae or at least one antibody directed against a compound of a microorganism 
different from Chlamydia pneumoniae, immobilized on the support of the said chip. 

The invention also relates to a kit or set for the detection and/or the identification of 
bacteria belonging to the species Chlamydia pneumoniae or to an associated microorganism, or for the 
20 detection and/or the identification of a microorganism characterized in that it comprises a protein chip 
according to the invention. 

The subject of the present invention is also a method for the detection and/or the 
identification of bacteria belonging to the species Chlamydia pneumoniae or to an associated 
microorganism in a biological sample, characterized in that it uses a nucleotide sequence according to 
25 the invention. 

More particularly, the invention relates to a method for the detection and/or the 
identification of bacteria belonging to the species Chlamydia pneumoniae or to an associated 
microorganism in a biological sample, characterized in that it comprises the following steps: 

a) where appropriate, isolation of the DNA from the biological sample to be analysed, or optionally 
3 0 production of a cDNA from the RNA in the biological sample; 

b) specific amplification of the DNA of bacteria belonging to the species Chlamydia pneumoniae or 

to an associated microorganism with the aid of at least one primer according to the 
invention; 

c) detection of the amplification products. 

35 These may be detected, for example, by the molecular hybridization technique using a 

nucleic probe according to the invention. This probe will be advantageously labelled with a 
nonradioactive (cold probe) or radioactive element. 
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For the purposes of the present invention, "DNA in the biological sample" or 
"DNA contained in the biological sample" will be understood to mean either the DNA present in the 
biological sample considered, or optionally the cDNA obtained after the action of a reverse 
transcriptase-type enzyme on the RNA present in the said biological sample. 
5 Another aim of the present invention consists in a method according to the invention, 

characterized in that it comprises the following steps: 

a) bringing a nucleotide probe according to the invention into contact with a biological sample, the 

DNA contained in the biological sample having, where appropriate, been previously made 
accessible to hybridization, under conditions allowing the hybridization of the probe to 
10 complementary base pairs of the DNA of a bacterium belonging to the species Chlamydia 

pneumoniae or to an associated microorganism; 

b) detecting the hybridization complex formed between the nucleotide probe and the DNA in the 

biological sample. 

The present invention also relates to a method according to the invention, characterized in 

1 5 that it comprises the following steps: 

a) bringing a nucleotide probe immobilized on a support according to the invention into contact 
with a biological sample, the DNA in the sample having, where appropriate, been previously 
made accessible to hybridization, under conditions allowing the hybridization of the probe to 
the DNA of a bacterium belonging to the species Chlamydia pneumoniae or to an associated 

20 microorganism; 
" b) bringing the hybrid formed between the nucleotide probe immobilized on a support and the DNA 

contained in the biological sample, where appropriate after removal of the DNA in the 
biological sample which has not hybridized with the probe, into contact with a labelled 
nucleotide probe according to the invention; 
25 c) detecting the new hybrid formed in step b). 

According to an advantageous embodiment of the method for the detection and/or the 
identification defined above, it is characterized in that, prior to step a), the DNA in the biological 
sample is primer-extended and/or amplified beforehand with the aid of at least one primer accordmg to 
the invention. 

30 The invention relates, in addition, to a kit or set for the detection and/or the identificatton 

of bacteria belonging to the species Chlamydia pneumoniae or to an associated microorganism, 
characterized in that it comprises the following components: 

a) a nucleotide probe according to the invention; 

b) where appropriate, the reagents necessary for carrying out a hybridization reaction; 

35 c) where appropriate, at least one primer according to the invention as well as the reagents (e^, 
polymerase and/or deoxynucleotide triphosphates) necessary for a DNA amplification 
reaction. 
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The invention also relates to a kit or set for the detection and/or the identification of 
bacteria belonging to the species Chlamydia pneumoniae or to an associated microorganism, 
characterized in that it comprises the following components: 
a) a nucleotide probe, called capture probe, according to the invention; 
5 b) an oligonucleotide probe, called detection probe, according to the invention; 

c) where appropriate, at least one primer according to the invention as well as the reagents ( e.g. , 
polymerase and/or deoxynucleotide triphosphates) necessary for a DNA amplification 
reaction. 

The invention also relates to a kit or set for the detection and/or the identification of 
10 bacteria belonging to the species Chlamydia pneumoniae or to an associated microorganism, 
characterized in that it comprises the following components: 

a) at least one primer according to the invention; 

b) where appropriate, the reagents necessary for carrying out a DNA amplification reaction; 

c) where appropriate, a component which makes it possible to check the sequence of the amplified 
15 fragment, more particularly an oligonucleotide probe according to the invention. 

The invention relates, in addition, to a kit or set for the detection and/or the identification 
of bacteria belonging to the species Chlamydia pneumoniae or to an associated microorganism, or for 
the detection and/or the identification of a microorganism characterized in that it comprises a DNA 
chip according to the invention. 

20 The invention also relates to a method or to a kit or set according to the invention for the 

detection and/or the identification of bacteria belonging to the species Chlamydia pneumoniae, 
characterized in that the said primer and/or the said probe according to the invention are chosen from 
the nucleotide sequences specific to the species Chlamydia pneumoniae, in that the said polypeptides 
according to the invention are chosen from the polypeptides specific to the species Chlamydia 

25 pneumoniae and in that the said antibodies according to the invention are chosen from the antibodies 
directed against the polypeptides according to the invention chosen from the polypeptides specific to 
the species Chlamydia pneumoniae. 

Preferably, the said method or the said kit or set above according to the invention, for the 
detection and/or the identification of bacteria belonging to the species Chlamydia pneumoniae is 

3 0 characterized in that the said primer and/or the said probe or the said polypeptides are chosen from the 
nucleotide sequences or polypeptides according to the invention which have been identified as being 
specific to the species Chlamydia pneumoniae and in that the said antibodies according to the 
invention are chosen from the antibodies directed against the polypeptides according to the invention 
chosen from the polypeptides identified as being specific to the species Chlamydia pneumoniae. 

35 The invention relates, in addition, to a method or a kit or set according to the invention 

for the diagnosis of predispositions to, or of a condition caused by, cardiovascular diseases, preferably 
linked to the presence of atheroma, which are induced or worsened by a Chlamydia pneumoniae 
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^ The invention also relates to a method or a kit or set according to the invention for the 

diagnosis of predispositions to, or of conditions caused by, respiratory diseases induced or worsened 

According to another aspect, the subject of the invention is the use of po.ypept.des 
according to the invention, of cells transformed with a vector according to the invention and/or of 
transformed animals according to the invention, for the biosynthesis or the biodegradation of organ, 

or inorganic compounds. 

As has been mentioned above, the nucleotide sequences of the invention were .denttfied 
10 by homology with sequences known to encode, for example, polypeptides or fragments of enzymaUc 
polypeptides involved in the biosynthesis or the biodegradation of organic or inorgamc molecule. 

It is thus possible to use the said polypeptides of the invention in a similar manner for the 
biosynthesis or the biodegradation of organic or inorganic compounds of industrial or therapeutK 
interest (called compounds of interest). 

A^ong .hese polypepudes, *« ™» <* » *" ^ '"^ 

such as .he peo,eo,y,ic ^» -*> — — *— — « * <T" 
which may he use. in d> biosymhesis of sugaas, amino acids, My acid,, pofypepadea, nucleo-das, 
1* JL « any on*, organic « inoiganie compound . in ,h= biodegoadarion of cugan.0 0, 

./^"iC-po.yppp.-.---— 

e„ wm =s eoaresponding » muua,cd o, - *M»> — *• » - ^ 
to be used for -he biosymhesis « .he biodegmdauon of organic o, inorganic compounds d.e 
industrial fevel, such as, for *« P— of compounds of inrerear, .he rep,oc=ss,ng of 

manufacruring maidues appiied .. food iudusuies, ,o dae papormahang indusriy o, ,o .he chonuod 

25 and pharmaceutical industries. 

The methods of biosynthesis or biodegradation of organic or morgan, compounds, 
characterized in that they use a polypeptide or one of its representative fragments according to the 
invention, transformed cells according to the invention and/or a transformed animal accordmg to the 

invention, also form part of the invention. 
30 The invention relates, in addition, to the use of a nucleotide sequence accordmg to the 

invention, of a polypeptide according to the invention, of an antibody according to the invention, of a 
eel, according to the invention, and/or of a transformed animal according to the mvenfon, for the 
selection of an organic or inorganic compound capable of modulating, regulating, mducmg or 
inhibiting the expression of genes, and/or of modifying the cellular replication of eukaryouc or 
35 prokaryotic cells or capable of inducing, inhibiting or worsening the pathologies linked to an infecUon 
by Chlamydia pneumoniae or one of its associated microorganisms. 

The invention also comprises screening assays that comprise methods of selecting 
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compounds capable of binding to a polypeptide, fusion polypeptide or one of its representative 
fragments according to the invention, capable of binding to a nucleotide sequence according to the 
invention, or capable of recognizing an antibody according to the invention, and/or capable of 
modulating, regulating, inducing or inhibiting the expression of genes, and/or of modifying the growth 
5 or the cellular replication of eukaryotic or prokaryotic cells, or capable of inducing, inhibiting or 
worsening, in an animal or human organism, the pathologies linked to an infection by Chlamydia 
pneumoniae or one of its associated microorganisms, characterized in that it comprises the following 
steps: 

a) bringing the said compound into contact with the said polypeptide, the said nucleotide 
10 sequence, with a transformed cell according to the invention and/or administering the said compound 

to a transformed animal according to the invention; 

b) determining the capacity of the said compound to bind with the said polypeptide or the 
said nucleotide sequence, or to modulate, regulate, induce or inhibit the expression of genes, or to 
modulate growth or cellular replication, or to induce, inhibit or worsen in the said transformed animal, 

15 the pathologies linked to an infection by Chlamydia pneumoniae or one of its associated 
microorganisms. 

The transformed cells and/or animals according to the invention may advantageously 
serve as a model and may be used in methods for studying, identifying and/or selecting compounds 
capable of being responsible for pathologies induced or worsened by Chlamydia pneumoniae, or 

20 capable of preventing and/or of treating these pathologies such as, for example, cardiovascular or 
respiratory diseases. In particular, the transformed host cells, in particular bacteria of the Chlamydia 
family whose transformation with a vector according to the invention may, for example, increase or 
inhibit its infectivity, or modulate the pathologies usually induced or worsened by the infection, may 
be used to infect animals in which the onset of pathologies will be monitored. These nontransformed 

25 animals, infected for example with transformed Chlamydia bacteria, may serve as a study model. In 
the same manner, the transformed animals according to the invention may, for example, exhibit 
predispositions to cardiovascular and/or respiratory diseases and thus be used in methods for selecting 
compounds capable of preventing and/or of treating the said diseases. The said methods using the said 
transformed cells and/or transformed animals form part of the invention. 

30 The compounds capable of being selected may be organic compounds such as 

polypeptides or carbohydrates or any other organic or inorganic compounds already known, or new 
organic compounds produced using molecular modeling techniques and obtained by chemical or 
biochemical synthesis, these techniques being known to persons skilled in the art. 

The said selected compounds may be used to modulate the growth and/or the cellular 

35 replication of Chlamydia pneumoniae or any other associated microorganism and thus to control 
infection by these microorganisms. The said compounds according to the invention may also be used 
to modulate the growth and/or the cellular replication of all eukaryotic or prokaryotic cells, in 
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particular tumour cells and infectious microorganisms, for which the said compounds 
wi,, prove active, the methods which make it possible to determine the said modulations bemg well 

known to persons skilled in the art. 

Compound capable of modulating the growth of a microorganism is understood to 
5 designate any compound which makes it possible to act, to modify, to limit and/or to reduce the 
development, the growth, the rate of proliferation and/or the viability of the said microorgan 1S m. 

This modulation may be achieved, for example, by an agent capable of bmdmg to a 
protein and thus of inhibiting or of potentiating its biological activity, or capable of binding to a 
me mbrane protein of the outer surface of a microorganism and of blocking the penetration of the sa,d 
10 microorganism into the host cell or of promoting the action of the immune system of the infected 
organism directed against the said microorganism. This modulation may also be achieved by an agent 
capable of binding to a nucleotide sequence of a DNA or RNA of a microorganism and of b.ockmg, 
for example, the expression of a polypeptide whose biological or structural activity is necessary for the 
growth or for the reproduction of the said microorganism. 

Associated microorganism is understood to designate in the present invention any 
nucroorganism whose gene expression may be modulated, regulated, induced or inhibited, or whose 
growth or cellular replication may also be modulated by a compound of the invention. Assorted 
nucroorganism is also understood to designate in the present invention any microorganism contammg 
nucleotide sequences or polypeptides according to the invention. These microorganisms may. m some 
20 cases, contain polypeptides or nucleotide sequences identical or homologous to those of the invents 
roay also be detected and/or identified by the detection and/or identification methods or kit according 
to the invention and may also serve as a target for the compounds of the invention. 

The invention relates to the compounds capable of being selected by a method of 

selection according to the invention. 
25 The invention also relates to a pharmaceutical composition comprising a compound 

chosen from the following compounds: 
a nucleotide sequence according to the invention; 
a polypeptide according to the invention; 
a vector according to the invention; 
30 an antibody according to the invention; and 

ompound capable of being selected by a method of selection according to the invention, opuonal.y 
in combination with a pharmaceuticaUy acceptable vehicle. 
An effective quantity is understood to designate a sufficient quantity of the saad 
ompound or antibody, or of a polypeptide of the invention, which makes it possible to modulate the 
3 5 growth of Chlamydia pneumoniae or of an associated microorganism. 

The invention also relates to a pharmaceutical composition comprising one or more 
polypeptides according to the invention and/or one or more fusion polypeptides according to the 
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invention. Such compositions further comprise a pharmaceutically acceptable carrier or vehicle. 
Pharmaceutical compositions include compositions that comprise a polypeptide or fusion polypeptide 
that immunoreacts with seropositive serum of an individual infected with Chlamydia pneumoniae. In 
one embodiment, a pharmaceutical composition according to the invention can be utilized for the 
5 prevention or the treatment of an infection by a bacterium belonging to the species Chlamydia 
pneumoniae or by an associated microorganism. 

The invention relates, in addition, to an immunogenic composition or a vaccine 
composition, characterized in that it comprises one or more polypeptides according to the invention 
and/or one or more hybrid (fusion) polypeptides according to the invention. Such compositions 
10 further comprise a pharmaceutically acceptable carrier or vehicle. Immunogenic compositions or 
fusion polypeptide include compositions that comprise a polypeptide that immunoreacts with 
seropositive serum of an individual infected with Chlamydia pneumoniae. 

Immunogenic or vaccine compositions can also comprise DNA immunogenic or vaccine 
compositions comprising polynucleotide sequences of the invention operatively associated with a 
15 regulatory sequence that controls gene expression. Such compositions can include compositions that 
direct expression of a neutralizing epitope of Chlamydia pneumoniae. 

The invention also comprises the use of a transformed cell according to the invention, for 
the preparation of a vaccine composition. 

The invention also relates to a vaccine composition, characterized in that it contains a 
20 nucleotide sequence according to the invention, a vector according to the invention and/or a 
transformed cell according to the invention. 

The invention also relates to the vaccine compositions according to the invention, for the 
prevention or the treatment of an infection by a bacterium belonging to the species Chlamydia 
pneumoniae or by an associated microorganism. 
25 The invention also relates to the use of DNA encoding polypeptides of Chlamydia 

pneumoniae, in particular antigenic determinants, to be formulated as vaccine compositions. In 
accordance with this aspect of the invention, the DNA of interest is engineered into an expression 
vector under the control of regulatory elements, which will promote expression of the DNA, i.e., 
promoter or enhancer elements. In one preferred embodiment, the promoter element may be cell- 
30 specific and permit substantial transcription of the DNA only in predetermined cells. The DNA may 
be introduced directly into the host either as naked DNA (U.S. Patent No. 5,679,647 incorporated 
herein by reference in their entirety) or formulated in compositions with other agents which may 
facilitate uptake of the DNA including viral vectors, i.e., adenovirus vectors, or agents which facilitate 
immunization, such as bupivicaine and other local anesthetics (U.S. Patent 5,593,972 incorporated 
35 herein by reference in their entirety), saponins (U.S. Patent 5,739,118 incorporated herein by 
reference in their entirety) and cationic polyamines (published international application WO 96/10038 
incorporated herein by reference in their entirety). 
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The DNA sequence encoding the antigenic polypeptide and regulatory element may 
be inserted into a stable cell line or cloned microorganism, using techniques, such as targeted 
homologous recombination, which are well known to those of skill in the art, and described e.g., in 
Chappel, US. Patent No. 4,215,051; Skoultchi, WO 91/06667 each of which is incorporated herein by 

5 reference in its entirety. 

Such cell lines and microorganisms may be formulated for vaccine purposes. In yet 
another embodiment, the DNA sequence encoding the antigenic polypeptide and regulatory element 
ma y be delivered to a mammalian host and introduced into the host genome via homologous 
recombination (See. Chappel, U.S. Patent No. 4,215,051; Skoultchi, WO 91/06667 each of which is 
1 0 incorporated herein by reference in its entirety. 

Preferably, the immunogenic and/or vaccine compositions according to the invention 
intended for the prevention and/or the treatment of an infection by Chlamydia pneumoniae or by an 
associated microorganism will be chosen from the immunogenic and/or vaccine composes 
comprising a polypeptide or one of its representative fragments corresponding to a protein, or one of 
15 its representative fragments, of the cellular envelope of Chlamydia pneumoniae. The vaccine 
compositions comprising nucleotide sequences will also preferably comprise nucleotide sequences 
encoding a polypeptide or one of its representative fragments corresponding to a protem, or one of .ts 
representative fragments, of the cellular envelope of Chlamydia pneumoniae. 

Among these preferred immunogenic and/or vaccine compositions, the most preferred are 
2 0 those comprising a polypeptide or one of its representative fragments, or a nucleotide sequence or one 
of its representative fragments whose sequences are chosen from the nucleotide or ammo acid 
sequences identified in this functional group and listed above. 

The polypeptides of the invention or their representative fragments entering into the 
immunogenic compositions according to the invention may be selected by techniques known to 
25 persons skilled in the art, such as for example on the capacity of the said polypeptides to sumulate T 
cells, which results, for example, in their proliferation or the secretion of interleukins, and which leads 
to the production of antibodies directed against the said polypeptides. 

In mice, in which a weight dose of the vaccine composition comparable to the dose used 
in humans is administered, the antibody reaction is tested by collecting serum followed by a study of 
3 0 the formation of a complex between the antibodies present in the serum and the antigen of the vaccine 
composition, according to the customary techniques. 

According to the invention, the said vaccine compositions will be preferably in 
combination with a pharmaceutical* acceptable vehicle and, where appropriate, with one or more 

appropriate immunity adjuvants. 
3 5 Various types of vaccines are currently available for protecting humans against infectious 

diseases: attenuated live microorganisms (M. bovis - BCG for tuberculosis), inactivated 
microorganisms (influenza virus), acellular extracts (Bordetella pertussis for whooping cough), 
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recombinant proteins (hepatitis B virus surface antigen), polysaccharides (pneumococci). 
Experiments are underway on vaccines prepared from synthetic peptides or from genetically modified 
microorganisms expressing heterologous antigens. Even more recently, recombinant plasmid DNAs 
carrying genes encoding protective antigens were proposed as an alternative vaccine strategy. This 
5 type of vaccination is carried out with a particular plasmid derived from an E. coli plasmid which does 
not replicate in vivo and which encodes only the vaccinal protein. Animals were immunized by simply 
injecting the naked plasmid DNA into the muscle. This technique leads to the expression of the 
vaccine protein in situ and to a cell-type (CTL) and a humoral type (antibody) immune response. This 
double induction of the immune response is one of the main advantages of the technique of 

1 0 vaccination with naked DNA. 

The vaccine compositions of the present invention can be evaluated in in vitro and in vivo 
animal models prior to host, e.g. , human, administration. For example, in vitro neutralization assays 
such as those described by Peterson et al. (1988) can be utilized. The assay described by Peterson et 
al. (1988) is suitable for testing vaccine compositions directed toward either Chlamydia pneumoniae 

15 or Chlamydia trachomatis. 

Briefly, hyper-immune antisera is diluted in PBS containing 5% guinea pig serum, as a 
complement source. Chlamydiae (10 4 IFU; infectious units) are added to the antisera dilutions. The 
antigen-antibody mixtures are incubated at 37EC for 45 minutes and inoculated into duplicate 
confluent Hep-2 or HeLa cell monolayers contained in glass vials (e.g. , 15 by 45 mm), which have 

20 been washed twice with PBS prior to inoculation. The monolayer cells are infected by centrifugation 
at 1000X g for 1 hour followed by stationary incubation at 37E for 1 hour. Infected monolayers are 
incubated for 48 or 72 hours, fixed and stained with a Chlamydiae specific antibody, such as anti- 
MOMP for C. trachomatis, etc. IFUs are counted in ten fields at a magnification of 200X. 
Neutralization titer is assigned based on the dilution that gives 50% inhibition as compared to control 

25 monolayers/IFU. 

The efficacy of vaccine compositions can be determined in vivo by challenging animal 
models of Chlamydia pneumoniae infection, e^, mice or rabbits, with the vaccine compositions. For 
example, in vivo vaccine composition challenge studies can be performed in the murine model of 
Chlamydia pneumonia infection described by Moazed et al. (1997). Briefly, male homozygous apoE 

30 deficient and/or C57 BL/6J mice are immunized with vaccine compositions. Post-vaccination, the 
mice are mildly sedated by subcutaneous injection of a mixture of ketamine and xylazine, and 
inoculated intranasally with a total volume of 0.03-0.05 ml of organisms suspended in SPG medium or 
with SPG alone. The inoculations of Chlamydia pneumoniae are approximately 3xl0 7 IFU/mouse. 
The mice are inoculated with Chlamydia pneumoniae at 8, 10, and 12 weeks of age. Tissues are then 

35 collected from the lung, spleen, heart, etc. at 1-20 weeks after the first inoculation. The presence of 
organisms is scored using PCR, histology and immunocytochemistry, or by quantitative culture/IFU 
after tissue homogenization. 
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Alternatively, in vivo vaccine composition challenge studies can be performed 
in the rabbit mode, of Chlamydia pneumoniae described by Laitinen et al. (1997). Briefly, New 
Zealand white rabbits (5 months old) are immunized with the vaccine compositions. Post-vaccination, 
the rabbits are sedated with Hypnorm, 0.3 ml/Kg of body weight, intramuscularly, and ovulated 
5 intranasal with a total of 0.5 ml of Chlamydia pneumoniae suspended in SPG medium or with SPG 
alone The inoculations of Chlamydia pneumoniae are approximately 3xltf FU/rabbit. The rabbits 
are reinfected in the same manner and with the same dose 3 weeks after the primary inoculation. 
Tissues are then collected 2 weeks after the primary infection and 1, 2, and 4 weeks after the 
reinfection. The presence of Chlamydia pneumoniae is scored using PGR, histology and 
10 immunocytochemistry, or by quantitative culture/IFU after tissue homogenization. 

The vaccine compositions comprising nucleotide sequences or vectors into which the said 
sequences are inserted are in particular described in International Application No. WO 90/11092 and 
also in International Application No. WO 95/1 1307. 

The nucleotide sequence constituting the vaccine composition according to the invention 
15 may be injected into the host after having been coupled to compounds which promote the penetration 
of this polynucleotide inside the cell or its transport up to the cell nucleus. The resulting conjugates 
ma y be encapsulated into polymeric microparticles, as described in International Application No. 
WO 94/27238 (Medisorb Technologies International). 

According to another embodiment of the vaccine composition according to the invention, 
20 the nucleotide sequence, prefembly a DNA, is complexed with the DEAE-dextran (Pagano et al 
,967) or with nuclear proteins (Kaneda et al., 1989), with lipids (Feigner et al., 1987) or encapsulated 
into liposomes (Fraley et al., 1980) or alternatively introduced in the form of a gel facilitating its 
transfection into the cells (Midoux et al., 1993, Pastore et al., 1994). The polynucleotide or the vector 
according to the invention may also be in suspension in a buffer solution or may be combined with 

25 liposomes. . 

Advantageously, such a vaccine will be prepared in accordance with the technique 

described by Tacson et al. or Huygen et al. in 1996 or alternatively in accordance with the technique 
described by Davis et al. in International Application No. WO 95/1 1307. 

Such a vaccine may also be prepared in the form of a composition containing a vector 
30 according to die invention, placed under the control of regulatory elements allowing its expression n, 
humans or animals. It is possible, for example, to use, as vector for the in vivo expression of the 
polypeptide antigen of interest, the plasmid P cDNA3 or the plasmid pcDNAl/neo, both marketed by 
Invitrogen ® & D Systems, Abingdon, United Kingdom). It is also possible to use the plasmid 
VlJnstPA, described by Shiver et al. in 1995. Such a vaccine will advantageously comprise, in 
3 5 addition to the recombinant vector, a saline solution, for example a sodium chloride solution. 

The immunogenic compositions of the invention can also be utilized as part of methods 
for immunization, wherein such methods comprise administering to a host, e^, a human host, an 
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immunizing amount of the immunogenic compositions of the invention. In a preferred 
embodiment, the method of immunizing is a method of immunizing against Chlamydia pneumoniae. 

A pharmaceutical^ acceptable vehicle is understood to designate a compound or a 
combination of compounds entering into a pharmaceutical or vaccine composition which does not 
5 cause side effects and which makes it possible, for example, to facilitate the administration of the 
active compound, to increase its life and/or its efficacy in the body, to increase its solubility in solution 
or alternatively to enhance its preservation. These pharmaceutically acceptable vehicles are well 
known and will be adapted by persons skilled in the art according to the nature and the mode of 
administration of the active compound chosen. 

10 As regards the vaccine formulations, these may comprise appropriate immunity adjuvants 

which are known to persons skilled in the art, such as, for example, aluminum hydroxide, a 
representative of the family of muramyl peptides such as one of the peptide derivatives of N-acetyl- 
muramyl, a bacterial lysate, or alternatively incomplete Freund's adjuvant, Stimulon™ QS-21 (Aquila 
Biopharmaceuticals, Inc., Framingham, MA), MPL™ (3-O-deacylated monophosphoryl lipid A; RIBI 

15 ImmunoChem Research, Inc., Hamilton, MT), aluminum phosphate, IL-12 (Genetics Institute, 
Cambridge, MA). 

Preferably, these compounds will be administered by the systemic route, in particular by 
the intravenous route, by the intranasal, intramuscular, intradermal or subcutaneous route, or by the 
oral route. More preferably, the vaccine composition comprising polypeptides according to the 
20 invention will be administered several times, spread out over time, by the intradermal or subcutaneous 
route. 

Their optimum modes of administration, dosages and galenic forms may be determined 
according to criteria which are generally taken into account in establishing a treatment adapted to a 
patient, such as for example the patient's age or body weight, the seriousness of his general condition, 
2 5 tolerance of the treatment and the side effects observed. 

The invention comprises the use of a composition according to the invention for the 
treatment or the prevention of cardiovascular diseases, preferably linked to the presence of atheroma, 
which are induced or worsened by Chlamydia pneumoniae. 

Finally, the invention comprises the use of a composition according to the invention for 
30 the treatment or the prevention of respiratory diseases which are induced or worsened by the presence 
of Chlamydia pneumoniae, preferably asthma. 

Other characteristics and advantages of the invention appear in the following examples 

and figures: 
35 Legend to the figures : 



Figure 1 : Line for the production of Chlamydia pneumoniae sequences 
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Figure 2 : Analysis of the sequences and assembling 
Figure 3 : Finishing techniques 

Figure 3a) : Assembly map 

Figure 3b) : Determination and use of the orphan ends of the contigs 



5 



EXAMPLES 

Experimental procedures 

10 

Cells 

The Chlamydia pneumoniae strain (CM1) used by the inventors is obtained from ATCC 
(American Culture Type Collection) where it has the reference number ATCC 1360-VR. 

It is cultured on HeLa 229 cells, obtained from the American Type Culture Collection, 

1 5 under the reference ATCC CCL-2. 1 . 
Culture of the cells 

The HeLa ATCC CCL-2. 1 cells are cultured in 75-ml cell culture flasks (Corning). The 
culture medium is Dulbecco's modified cell culture medium (Gibco BRL No. 04101965) 
supplemented with MEM amino acids (Gibco BRL - No. 04301 140) L (5 ml per 500 ml of medium) 
20 and 5% foetal calf serum (Gibco BRL No. 10270 batch 40G8260K) without antibiotics or antifungals. 

The cell culture stock is maintained in the following manner. The cell cultures are 
examined under an inverted microscope. 24 hours after confluence, each cellular lawn is washed with 
PBS (Gibco BRL No. 041 14190), rinsed and then placed for 5 min in an oven in the presence of 3 ml 
of trypsine (Gibco BRL No. 25200056). The cellular lawn is then detached and then resuspended in 
25 120 ml of culture medium, the whole is stirred in order to make the cellular suspension homogeneous. 
30 ml of this suspension are then distributed per cell culture flask. The flasks are kept in a CO, oven 
(5%) for 48 hours at a temperature of 37°C. The cell stock is maintained so as to have available daily 
16 flasks of subconfluent cells. It is these subconfluent cells which wUl be used so as to be infected 
with Chlamydia. 25-ml cell culture flasks are also used, these flasks are prepared in a similar manner 
30 but the volumes used for maintaining the cells are the following: 1 ml of trypsine, 28 ml of culture 
medium to resuspend the cells, 7 ml of culture medium are used per 25-ml flask. 
TnfArtmn of the cells with Chlamydia 

Initially, the Chlamydiae are obtained frozen from ATCC (-70°C), in suspension in a 
volume of 1 ml. This preparation is slowly thawed, 500 ul are collected and brought into contact with 
35 subconfluent cells, which are obtained as indicated above, in a 25-ml cell culture flask, containing 
1 ml of medium, so as to cover the cells. The flask is then centrifuged at 2000 rpm in a "swing" rotor 
for microtitre plates, the centrifuge being maintained at a temperature of 35°C. After centrimgation, 
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the two flasks are placed in an oven at 35°C for three hours. 6 ml of culture medium containing 
cyclohexirnide (1 H,g/ml) are then added and the flask is stored at 35°C. After 72 hours, the level of 
infection is evaluated by direct immunofluorescence and by the cytopathogenic effect caused to the 
cells. 

5 Direct immunofluorescence 

Starting with infected cells, which were obtained as indicated above, a cellular smear is 
deposited with a Pasteur pipette on a microscope slide. The cellular smear is fixed with acetone for 
10 minutes; after draining the acetone, the smear is covered with 30 \i\ of murine monoclonal 
antibodies directed against MOMP (major outer membrane protein) of Chlamydia (Syva, Biomerieux) 
10 labelled with fluorescein isothiocyanate. The whole is then incubated in a humid chamber at a 
temperature of 37°C. The slides are then rinsed with water, slightly dried, and then after depositing a 
drop of mounting medium, a coverslip is mounted before reading. The reading is carried out with the 
aid of a fluorescence microscope equipped with the required filters (excitation at 490 nm, emission at 
520 nm). 

15 Harvesting of the Chlamydia pneumoniae 

After checking the infection by direct immunofluorescence, carried out as indicated 
above, the culture flasks are opened under a sterile cabinet, sterile glass beads with a diameter of the 
order of a millimeter are placed in the flask. The flask is closed and then vigorously stirred while 
being maintained horizontally, the cellular lawn at the bottom, so that the glass beads can have a 

20 mechanical action on the cellular lawn. Most of the cells are thus detached or broken; the effect of the 
stirring is observed under an optical microscope so as to ensure proper release of Chlamydiae. 
Large-scale infection of the cell cultures 

The product of the Chlamydiae harvest (culture medium and cellular debris) is collected 
with a pipette, and distributed into three cell culture flasks containing subconfluent HeLa ATCC CCL- 

25 2.1 cells, obtained as indicated above. The cells thus inoculated are placed under gentle stirring 
(swing) in an oven at 35°C. After one hour, the flasks are kept horizontally in an oven so that the 
culture medium covers the cells for 3 hours. 30 ml of culture medium containing actydione (1 M-g/ml) 
are then added to each of the flasks. The culture flasks are then stored at 35°C for 72 hours. The cells 
thus infected are examined under an optical microscope after 24 hours, the cytopathogenic effect is 

30 evaluated by the appearance of cytoplasmic inclusions which are visible under an inverted optical 
microscope. After 72 hours, the vacuoles containing the Chlamydiae occupy the cytoplasm of the cell 
and push the cell nucleus sideways. At this stage, numerous cells are spontaneously destroyed and 
have left free elementary bodies in the culture medium. The Chlamydiae are harvested as described 
above and are either frozen at -80°C or used for another propagation. 

35 Purification of the Chlamydiae 

The product of the Chlamydia harvests is stored at -80°C and thawed on a water bath at 
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room temperature. After thawing, each tube is vigorously stirred for one minute and immersed 
for one minute in an ultrasound tank (BRANSON 1200); the tubes are then stirred by inverUng before 
being centrifuged for 5min at 2000rpm. The supernatant is carefully removed and kept at cold 
temperature (ice). The supernatant is vigorously stirred and then filtered on nylon filters having pores 
5 of 5 microns in diameter on a support (Nalgene) allowing a delicate vacuum to be established under 
the nylon filter. For each filtration, three nylon filters are superposed; these filters are replaced after 
every 40 ml of filtrate. Two hundred milliliters of filtration product are kept at cold temperature, and 
then after stirring by inverting, are centrifuged at 10.000 rpm for 90 min, the supernatant is removed 
and the pellet is taken up in 10 ml of 10 mM Tris, vigorously vortexed and then centnfuged at 
10 10 000 rpm for 90 min. The supernatant is removed and the pellet is taken up in a buffer (20 mM Tns 
pH 8 0, 50 mM KC1, 5 mM MgCl 2 ) to which 800 units of DNAse 1 (Boehringer) are added. The whole 
is kept at 37°C for one hour. One ml of 0.5 M EDTA is then added, the whole is vortexed and frozen 

at-20°C. 

Preparation nf the DNA 

15 ~ The Chlamydiae purified above are thawed and subjected to a proteinase K (Boehringer) 

digestion in a final volume of 10 ml. The digestion conditions are the following: 0.1 mg/ml proteinase 
K 0 1 x SDS at 55EC, stirring every 10 min. The product of digestion is then subjected to a double 
extraction with phenol-chloroform, two volumes of ethanol are added and the DNA is directly 
recovered with a Pasteur pipette having one end in the form of a hook. The DNA is dried on the edge 

2 0 of the tube and then .suspended in 500 ,1 of 2 mM Tris P H 7.5. The DNA is stored at 4°C for at least 
24 hours before being used for the cloning. 
Cloning of the DNA 

After precipitation, the DNA is quantified by measuring the optical density at 260 nm. 
Thirty ug of Chlamydia DNA are distributed into 10 tubes of 1.5 ml and diluted in 300 u. of water. 
25 Each of the tubes is subjected to 10 applications of ultrasound lasting for 0.5 sec in a somcator 
(unisonix XL2020). The contents of the 10 tubes are then grouped and concentrated by successrve 
extractions with butanol (Sigma B1888) in the following manner: two volumes of butanol are added to 
the dilute DNA mixture. After stirring, the whole is centrifuged for five minutes at 2500 rpm and the 
butanol is removed. This operation is repeated until the volume of the aqueous phase is less than 1 ml. 
3 0 The DNA is then precipitated in the presence of ethano. and of 0.5 M sodium acetate P H 5.4, and then 
centrifuged for thirty minutes at 15,000 rpm at cold temperature (4»C). The pellet is washed w«h 75% 
ethanol centrifuged for five minutes at 15,000 rpm and dried at room temperature. A tenth of the 
preparation is analysed on a 0.8% agarose gel Typically, the size of the DNA fragments thus prepared 
is between 200 and 8000 base pairs. 
3 5 To allow the cloning of the DNA obtained, the ends are repaired. The DNA is distributed 

* i„ an amount of lO^g/tube, in the following reaction medium: 100 ul final volume, 1 x buffer 
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(Biolabs 201L), 0.5 nl BSA 0.05 mg/ml, 0.1 mM dATP, 0.1 mM each of dGTP, dCTP or dTTP, 
60,000 IU T4 DNA polymerase. The reaction is incubated for thirty minutes at 16°C. The contents of 
each of the tubes are then grouped before carrying out an extraction with phenol-chloroform and then 
precipitating the aqueous phase as described above. After this step, the DNA thus prepared is 
5 phosphorylated. For that, the DNA is distributed into tubes in an amount of 10 jig per tube, and then in 
a final volume of 50 the reaction is prepared in the following manner: 1 mM ATP, 1 x kinase 
buffer, 10 IU T4 polynucleotide kinase (Biolabs 201L). The preparation is incubated for thirty minutes 
at 37°C. The contents of the tubes are combined and a phenol-chloroform extraction and then a 
precipitation are carried out in order to precipitate the DNA. The latter is then suspended in 1 |il of 

10 water and then the DNA fragments are separated according to their size on a 0.8% agarose gel (1 x 
TAE). The DNA is subjected to an electric field of 5 V/cm and then visualized onaUV table. The 
fragments whose size varies between 1200 and 2000 base pairs are selected by cutting out the gel. The 
gel fragment thus isolated is placed in a tube and then the DNA is purified with the Qiaex kit (20021 
Qiagen), according to the procedure provided by the manufacturer. 

15 Preparation of the vector 

14 ^g of the cloning vector pGEM-5Zf (Promega P2241) are diluted in a final volume of 
150 |il and are subjected to digestion with the restriction enzyme EcoRV 300 IU (Biolabs 195S) 
according to the protocol and with the reagents provided by the manufacturer. The whole is placed at 
37°C for 150 min and then distributed in the wells of a 0.8% agarose gel subjected to an electric field 

20 of 5 V/cm. The linearized vector is visualized on a UV table, isolated by cutting out the gel and then 
purified by the Qiaex kit (Qiagen 20021) according to the manufacturer's recommendations. The 
purification products are grouped in a tube, the volume is measured and then half the volume of 
phenol is added and the whole is vigorously stirred for 1 min. Half the volume of chloroform-isoamyl 
alcohol 24:1 is added and vigorously stirred for 1 min. The whole is centrifuged at 15,000 rpm for 

25 5 min at 4°C, the aqueous phase is recovered and transferred into a tube. The DNA is precipitated in 
the presence of 0.3 M sodium acetate, pH 5.4 and 3 volumes of ethanol and placed at -20°C for 1 hour. 
The DNA is then centrifuged at 15,000 rpm for 30 min at 4°C, the supernatant is removed while 
preserving the pellet, washed twice with 70% ethanol. After drying at room temperature, the DNA is 
suspended in 25 |il of water. 

3 0 Phosphorylation of the vector 

25 fil of the vector prepared in the preceding step are diluted in a final volume of 500 |xl 
of the following reaction mixture: 

After repair, the DNA is subjected to a phenol-chloroform extraction and a precipitation, 
the pellet is then taken up in 10 |il of water, the DNA is quantified by measuring the optical density at 

35 260 nm. The quantified DNA is ligated into the vector PGEm-5Zf(+) prepared by the restriction 



10 



PCT/1B98/01890 

WO 99/27105 

78 

enzyme EcoRV and dephosphorylated (see potion of the vector). The ligation is carried 
out under three conditions which vary in the ratio between the number of vector molecules and the 
number of insert molecule, t^.**^^*™******-^ 
the ligations which are, moreover, carried out under the following conditions: vector PGEm-5Zf( + ) 
5 25 ng cut DNA, ligation buffer in a final volume of 20 ,1 with T4 DNA ligase (Amersham E70042X); 
the whole is then placed in a refrigerator overnight and then a phenol-chloroform extraction and a 
precipitation are carried out in a conventional manner. The pellet is taken up in 5 0 of water. 
Transformati on of the bacteria 
Platin g of the bacteria 

Petri dishes containing LB Agar medium containing ampicillin (50ug/ml), Xgal 
(280ug/ml) [S-bromo^-chloro-indolyl-beta-D-galactopyranoside (Sigma B-4252)], IPTG 
( ,40 Ug/ml) [isopropyl-beta-D-thiogalactoside (Sigma 1-6758)] are used, 50 and 100 pi of bacteria are 
p,atedfor each of the ligations. The Petri dishes are placed upside down at 37°C for 15 to 16hoursm 
an oven. The number of "recombinant" positive clones is evaluated by counting the white colonies and 
15 the blue colonies which are thought to contain the vector alone. 

Pv a i,, a tinn of tb» 'Wnmhinant" positive clones 

Ninety-four white colonies and two blue colonies are collected with the aid of stenle 
cones and are deposited at the bottom of the wells of plates designed for carrying out the amplified 
techniques 30 ul of the following reaction mixture are added to each well: 1.7 mM MgCh, 0.2 mM 
20 each of dATP, dCTP, dGTP and dTTP, two synthetic oligonucleotides correspondmg to sequences 
flanking the cloning sue on either s.de and onenting the synthesis of the DNA in a convergent manner 
(0 5 m RP and PU primers, 1 U TAQ polymerase (GibcoBRL 1 8038-026)). 

The colonies thus prepared are subjected to a temperature of 94°C for 5 min and then to 
30 thermal cycles composed of the following steps: 94°C for 40 s, 50°C for 30 s, 72°C for 180 s. The 
25 reaction is then kept for 7 min at 72°C and then kept at 4°C. 

The amplification products are deposited on an agarose gel (0.8%), stained with ethuhum 
bromide, subjected to electrophoresis, and then analysed on an ultraviolet table. The presence of an 
amplification fragment having a size greater than 500 base pairs indicates the presence of an msert. 
The bacterial clones are then prepared so as to study the sequence of their msert. 

30 Seq uencing 

To sequence the inserts of the clones obtained as above, these were amplified by PCR on 
bacteria cultures carried out overmght using the primers for the vectors flanking the inserts. The 
sequence of the ends of these inserts (on average 500 bases on each side) was determmed by 
automated fluorescent sequencing on an ABI 377 sequencer, equipped with the ABI Prism DNA 
3 5 Sequencing Analysis software (version 2. 1 .2). 
ftpgiycic nf thp. sequences 
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The sequences obtained by sequencing in a high-yield line (Figure 1) are 
stored in a database; this part of the production is independent of any treatment of the sequences. The 
sequences are extracted from the database, avoiding all the regions of inadequate quality, that is to say 
the regions for which uncertainties are observed on the sequence at more than 95%. After extraction, 
5 the sequences are introduced into a processing line, the diagram of which is described in Figure 2. In a 
first path of this processing line, the sequences are assembled by the Gap4 software from R. Staden 
(Bonfield et al., 1995) (OS UNIX/SUN Solaris); the results obtained by this software are kept in the 
form of two files which will be used for a subsequent processing. The first of these files provides 
information on the sequence of each of the contigs obtained. The second file represents all the clones 
1 0 participating in the composition of all the contigs as well as their positions on the respective contigs. 

The second processing path uses a sequence assembler (TIGR-Asmg assembler 
UNIX/SUN Solaris); the results of this second processing path are kept in the form of a file in the 
TIGR-Asmg format which provides information on the relationship existing between the sequences 
selected for the assembly. This assembler is sometimes incapable of linking contigs whose ends 
1 5 overlap over several hundreds of base pairs. 

The results obtained from these two assemblers are compared with the aid of the BLAST 
program, each of the contigs derived from one assembly path being compared with the contigs derived 
from the other path. 

For the two processing paths, the strict assembly parameters are fixed (95% homology, 

20 30 superposition nucleotides). These parameters avoid 3 to 5% of the clones derived from eukaryotic 
cells being confused with sequences obtained from the clones derived from Chlamydia pneumoniae. 
The eukaryotic sequences are however preserved during the course of this project; the strategy 
introduced, which is described below, will be designed, inter alia, not to be impeded by these 
sequences derived from contaminating clones. 

25 The results of these two assemblers are processed in a software developed for this 

project. This software operates on a Windows NT platform and receives, as data, the results derived 
from the STADEN software and/or the results derived from the TIGR-Asmg assembler, the software, 
results, after processing of the data, in the determination of an assembly map which gives the 
proximity relationship and the orientation of the contigs in relation to one another (Figure 3a). Using 

3 0 this assembly map, the software determines all the primers necessary for finishing the project. This 
treatment, which will be detailed below, has the advantage of distinguishing the isolated sequences 
derived from the contaminations, by the DNA eukaryotic cells, of the small-sized sequences clearly 
integrated into the project by the relationships which they establish with contigs. In order to allow, 
without any risk of error, the arrangement and the orientation of the contigs in relation to one another, 

35 a statistical evaluation of the accuracy of the names (naming) "naming" of sequence is made from the 
results of "contigation". This evaluation makes it possible to give each of the clone plates, as well as 
each of the subsets of plates, a weight which is inversely proportional to probable error rate existing in 
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of the sequence is called the inner primer whereas the primer which is more distant from the end of 
the sequence is called the outer primer. The outer primers are used to explore the mutual relationship 
between the orphan ends of the different contigs. The presence of a single PCR product and the 
possibility of amplifying this product unambiguously using the inner primers evokes the probable 
5 relationship between the contigs on which the primers which allowed the amplification are situated. 
This relationship will be confirmed by sequencing and will allow the connection between the orphan 
ends of the consensus sequences. This strategy has made it possible to obtain a complete map of the 
Chlamydia pneumoniae chromosome and then to finish the project. 
Quality control 

1 0 All the bases not determined with certainty in the chromosomal sequence were noted and 

the density of uncertainties was measured on the entire chromosome. The regions with a high density 
of uncertainties were noted and the PCR primers spanning these regions were drawn and are 
represented in Table 4 of parent U.S. application serial No. 60/107078 filed November 4, 1998 and 
French application 97-14673 filed November 21, 1997 each of which is incorporated by reference 

15 herein in its entirety. 

The sequence of each of the PCR products was obtained with two operational primers 
different from the amplification primers. The sequences were obtained in both directions for all the 
PCRs(100% success). 

Data banks 

2 0 Local reorganizations of major public banks were used. The protein bank used consists of 

the nonredundant fusion of the Genpept bank (automated translation of GenBank, NCBI; Benson 
, etal., 1996). 

The entire BLAST software (public domain, Altschul etal., 1990) for searching for 
homologies between a sequence and protein or nucleic data banks was used. The significance levels 
25 used depend on the length and the complexity of the region tested as well as the size of the reference 
bank. They were adjusted and adapted to each analysis. 

The results of the search for homologies between a sequence according to the invention 
and protein or nucleic data banks are presented and summarized in Table 1 below. 

30 Table 1: List of coding chromosome regions and homologies between these regions and the sequence 
banks. 

Legend to Table 1: Open reading frames are identified with the GenMark software 
version 2.3A (GenePro), the template used is Chlamydia pneumoniae of order 4 on a length of 
196 nucleotides with a window of 12 nucleotides and a minimum signal of 0.5. The reading frames 

3 5 ORF2 to ORF 1 1 37 are numbered in order of appearance on the chromosome, starting with ORF2 

(ORF column). The positions of the beginning and of the end are then given in column 2 (position). 
When the position of the beginning is greater than the position of the end, this means that the region is 
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(4) A search for ORFs that determined all ORF lengths using start and stop positions in the 

same reading frames was performed. 

(5) All ORFs whose DNA length was less than 225 nucleotides were eliminated from the search. 

5 Surface Exposed Protein Search Criteria 

Potential cell surface vaccine targets are outer membrane proteins such as porins, 
lipoproteins, adhesions and other non-integral proteins. In Chlamydia psittaci, the major immunogens 
is a group of putative outer membrane proteins (POMPs) and no homologs have been found in 
Chlamydia pneumoniae and Chlamydia trachomatis by traditional analysis (Longbottom, D., Russell, 

10 M., Dunbar, S.M., Jones, G.E., and A.J. Herring. 1998. Molecular Cloning and Characterization of the 
Genes Coding for the Highly Immunogenic Cluster of 90-Kilodalton Envelope Proteins from 
Chlamydia psittaci Subtype That Causes Abortion in Sheep. Infect Immun 66:1317-1324.) Several 
putative outer membrane proteins have been identified in Chlamydia pneumoniae, all of which may 
represent vaccine candidates. The major outer membrane protein (MOMP) gene (ompl) has been 

15 found in various isolates of Chlamydia pneumoniae (Jantos, CA., Heck, S., Roggendorf, R., Sen- 
Gupta, M., and Hegemann, JH. 1997. Antigenic and molecular analyses of different chlamydia 
pneumoniae strains. J. Clin Microbiology 35(3):620-623.) Various criteria, as listed below, were used 
to identify putative surface exposed ORFs from the genomic DNA sequence of Chlamydia 
pneumoniae (French application 97-14673 filed 21 November 1997). Any ORF which met any one or 

2 0 more of the individual criteria were listed in this category. 

Protein homology searches were done using the Blastp 2.0 tool (Altschul, S.F., Madden, 
T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and D.J. Lipman. 1997. Gapped BLAST and 
PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389- 
3402. ) An ORF product was labeled surface exposed if there was homology to a known, or 

2 5 hypothetical, or putative surface exposed protein with a P score better than e" 10 . 

Most, if not all, proteins that are localized to the membrane of bacteria, via a secretory 
pathway, contain a signal peptide. A software program, SignalP, analyzes the amino acid sequence of 
an ORF for such a signal peptide (Nielsen, H., Engelbrecht. J., Brunak, S., and G. von Heijne. 1997. 
Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. 

30 Protein Engineering 10:1-6.) The first 60 N-terminal amino acids of each ORF were analyzed by 
SignalP using the Gram-Negative software database. The output generates four separate values, 
maximum C, maximum Y, maximum S, and mean S. The S-score, or signal region, is the probability 
of the position belonging to the signal peptide. The C-score, or cleavage site, is the probability of the 
position being the first in the mature protein. The Y-score is the geometric average of the C-score and 

35 a smoothed derivative of the S-score. A conclusion of either a Yes or No is given next to each score. 
If all four conclusions are Yes and the C-terminal amino acid is either a phenylalanine (F) or a tyrosine 
(Y), the ORF product was labelled outer membrane (Struyve, M., Moons, M., and J. Tommassen. 



PCI71B98/01890 

WO 99/27105 

84 

1991 . Carboxy,erminal Phenylalanine is Essential for the Correct Assembly of a Bacterial 
OuterMembraneProtein.I.Mol.Biol.218:141-148.) 

The program called Psort, determines the localization of a protem based on Us stgnal 
sequence, recognition of transmembrane segments, and analysis of its amino acid composition (Nakar, 
5 K andMKanehisa.1991. Expert system for predicting protein localization sites in gram-negattve 
bacteria. Proteins 1 1:95-1 10.) An ORF product is considered to be an outer membrane protem rf the 
output data predicts the protein as outer membrane with a certainty va,ue of 0, or better and whose 
value is at least twice as large as the next predicted localized certainty value. 

Finally ORF products that were not predicted to be outer membrane or surface exposed, 
10 based on the above' cnteria, we. further analyzed. The b,astp output data for these ORFs were 
searched using various general and specific Keywords, suggestive of Known cell surface exposed 
proteins An ORF was labeled surface exposed if the keywords matched had a Blastp hrt, had a P 
score better than e». and that there was no better data mdicaung otherw.se. The fo.lowing is a hst of 
the searched keywords: 
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indicating otherwise. A list of ORFs in the Chlamydia pneumoniae genome encoding 
putative surface exposed proteins is set forth above in the specification. 

Identification of Putative Lipoproteins in the Genome of Chlamydia pneumoniae 
5 Lipoproteins are the most abundant post-translationally modified bacterial 

secretory proteins (Pugsley, A. P.. 1993. The complete general secretory pathway in Gram- 
negative bacteria. Microbiol. Rev. 57:50-108). The characteristic features of lipoproteins are 
a thiol-linked diacylglyceride and an amine-linked monoacyl group on the cysteine that 
becomes the amino-terminal residue after signal peptide cleavage by Signal Peptidase II. 

10 (Pugsley, A. P.. 1993. The complete general secretory pathway in Gram-negative bacteria. 
Microbiol. Rev. 57:50-108). The identification of putative lipoproteins from the genomic 
sequencing of Chlamydia pneumoniae was done by examining the deduced amino acid 
sequence of identified ORFs for the presence of a signal peptide with a Signal Peptidase II 
cleavage site analogous to the consensus sequence for prolipoprotein modification and 

15 processing reactions (Hayashi, S., and H. C. Wu. 1992. Identification and characterization of 
lipid-modified proteins in bacteria, p. 261-285. In N. M. Hooper and A. J. Turner (ed.) Lipid 
modification of proteins: A practical approach. Oxford University Press, New York; Sutcliffe, 
I. C. and R. R. B. Russell. 1995. Lipoproteins of Gram-positive bacteria. J. Bacteriol. 
177:1123-1128.). 

20 Chlamydia pneumoniae ORFs were initially screened for the most basic of lipoprotein 

characteristics, a cysteine in the first 30 amino acids of the deduced protein. ORFs with a 
standard start codon (ATG, GTG, or TTG) and having one or more of the following 
characteristics were selected for direct analysis of their first 30 amino acids: 

(a) Significant Signal P value (at least two out of the four values are Yes) 

25 

(b) PSORT value indicating membrane passage (IM-inner membrane, Peri-periplasm, or 

OM-outer membrane) 

(c) Identification of the word lipoprotein among the ORF blastp data set. 

30 (d) A Blastp value of <e" J0 with a putative lipoprotein from Chlamydia trachomatis 

(French applications 97-15041 filed 28 November 1997 and 97-16034 filed 17 
December 1997). 

The first 30 amino acids of each ORF in this set were analyzed for the characteristics 
commonly found in lipoprotein signal peptides (Pugsley, A. P.. 1993. The complete general secretory 
35 pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108; Hayashi, S., and H. C. Wu. 1992. 
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Identification and characterization of lipid- modified proteins in bacteria, p. 261-285. In N. 
M. Hooper and A. J. Turner (ed.) Lipid modification of proteins: A practical approach. Oxford 
University Press, New York; Sutcliffe, I. C. and R. R. B. Russell. 1995. Lipoproteins of Gram-positive 
bacteria. J. Bacterid. 177:1123-1128.) Putative lipoprotein signal peptides were required to have a 
5 cysteine between amino acid 10 and 30 and reach a minimum score of three based on the following 
criteria for lipoprotein signal peptides: 

(a) Identification of specific amino acids in specific positions around the cysteine which are part of 
the consensus Signal Peptidase II cleavage site (Hayashi, S., and H. C. Wu. 1992. 
Identification and characterization of lipid-modified proteins in bacteria, p. 261-285. In N. M. 
10 Hooper and A. J. Turner (ed.) Lipid modification of proteins: A practical approach. Oxford 

University Press, New York); Sutcliffe, I. C. and R. R. B. Russell. 1995. Lipoproteins of 
Gram-positive bacteria. J. Bacterid. 177:1123-1128). Since the identification of the cleavage 
site is the most important factor in identifying putative lipoproteins, each correctly positioned 
amino acid contributed toward reaching the minimum score of three, (b) A hydrophobic 
15 region rich in alanine and leucine prior to the cleavage site (Pugsley, A. P.. 1993. The 

complete general secretory pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108) 
contributed toward reaching the minimum score of three. 

(c) A short stretch of hydrophilic amino acids greater than or equal to 1 usually lysine or 
arginine following the N-terminal methionine (Pugsley, A. P.. 1993. The complete 
20 general secretory pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108) 

contributed toward reaching the minimum score of three. 
A list of ORFs in the Chlamydia pneumoniae genome encoding putative lipoproteins is set forth above 
in the specification. 

25 T .PS-Related ORFs of Ch lnmy/Ua pneumoniae 

Lipopolysaccharide (LPS) is an important major surface antigen of Chlamydia cells. 
Monoclonal antibodies (Mab) directed against LPS of Chlamydia pneumoniae have been identified 
that can neutralize the infectivity of Chlamydia pneumoniae both in vitro and in vivo (Peterson, E.M., 
de la Maza, L.M., Brade, L., Brade, H. 1998. Characterization of a Neutralizing Monoclonal 
30 Antibody Directed at the Lipopolysaccharide of Chlamydia pneumonia. Infect. Immun. Aug. 
66(8):3848-3855.) Chlamydial LPS is composed of lipid A and a core oligosaccharide portion and is 
phenotypically of the rough type (R-LPS) (Lukacova, M.. Baumann, M., Brade, L„ Mamat, U., 
Brade, H. 1994. Lipopolysaccharide Smooth-Rough Phase Variation in Bacteria of the Genus 
Chlamydia. Infect. Immun. June 62(6):2270-2276.) The lipid A component is composed of fatty acids 
35 which serve to anchor LPS in the outer membrane. The core component contains sugars and sugar 
derivatives such as a trisaccharide of 3-deoxy-D-manno-octulosonic acid (KDO) (Reeves, P.R., 
Hobbs, M.. Valvano, M.A., Skumik, M., Whitfield, C, Coplin, D., Kido, N., Klena, J., Maskell, D., 
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Raetz, C.R.H., Rick, P.D. 1996. Bacterial Polysaccharide Synthesis and Gene 
Nomenclature pp. 10071-10078, Elsevier Science Ltd.). The KDO gene product is a multifunctional 
glycosyltransferase and represents a shared epitope among the Chlamydia. For a review of LPS 
biosynthesis see, e.g., Schnaitman, C.A., Klena, J.D. 1993. Genetics of Lipopolysaccharide 
5 Biosynthesis in Enteric Bacteria. Microbiol. Rev. 57:655-682. 

A text search of the ORF blastp results identified several genes that are involved in 
Chlamydial LPS production with a P score better than e" 10 . The following key-terms were used in the 
text search: KDO, CPS (Capsular Polysaccharide Biosynthesis), capsule, LPS, rfa, rfb, rfc, rfe, rha, 
rhl, core, epimerase, isomerase, transferase, pyrophosphorylase, phosphatase, aldolase, heptose, 
10 manno, glucose, lpxB, fibronectin, fibrinogen, fucosyltransferase , lie, Igt, pgm, tolC, rol, ChoP, 
phosphorylcholine, waaF, PGL-Tbl. A list of ORFs in the Chlamydia pneumoniae genome encoding 
putative polypeptides involved in LPS biosynthesis is set forth above in the specification. 

Type III And Other Secreted Products 

15 Type III secretion enables gram-negative bacteria to secrete and inject pathogenicity 

proteins into the cytosol of eukaryotic host cells (Hueck, C. J., 1998. Type III Protein Secretion 
Systems in Bacterial Pathogens of Animals and Plants. In Microbiology and Molecular Biology 
Reviews. 62:379-433.) These secreted factors often resemble eukaryotic signal transduction factors, 
thus enabling the bacterium to redirect host cell functions (Lee, C.A., 1997. Type in secretion 

20 systems: machines to deliver bacterial proteins into eukaryotic cells? Trends Microbiol. 5:148-156.) 
In an attempt to corrupt normal cellular functions, Chlamydial pathogenicity factors injected into the 
host cytosol will nonetheless, as cytoplasmic constituents be processed and presented in the context of 
the Major Histocompatibility Complex (MHC class I). As such, these pathogenicity proteins represent 
MHC class I antigens and will play an important role in cellular immunity. Also included in this set 

25 are secreted non-type III products that may play a role as vaccine components. 

A text search of the ORF blastp results identified genes that are involved in Chlamydia 
pneumoniae protein secretion with a P score better than e" 10 . The following key-terms were used in the 
text search in an effort to identify surface localized or secreted products: Yop, Lcr, Ypk, Exo, Per, 
Pop, Ipa, Vir, Ssp, Spt, Esp, Tir, Hrp, Mxi, hemolysin, toxin, IgA protease, cytolysin, tox, hap, 

30 secreted and Mip. 

Chlamydia pneumoniae ORFs that did not meet the above keyword search criteria, but 
have homologs in Chlamydia trachomatis that do meet the search criteria are included herein. The 
Chlamydia trachomatis genome (French patent applications FR97- 15041, filed 28 November 1997 
and 97-16034 filed 17 December 1997) was analyzed using the above search criteria and a number of 

35 ORFs were identified. These Chlamydia trachomatis ORFs were tested against the Chlamydia 
pneumoniae genome using Blastp. Any Chlamydia pneumoniae ORF with a Blastp P value < e* 10 
against a Chlamydia trachomatis homolog, identified using the above search criteria, was included. A 
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d) Pertactin, a 69 kDa outer membrane protein of Bordetella pertussis, has been shown to 
promote attachment of Chinese hamster ovary cells (CHO). This attachment is mediated by 
recognition of RGD sequence in pertactin by integrins on CHO cells and can be inhibited by 
5 synthetic RGD containing peptide homologous to the one present in pertactin (Leininger, E., 

Roberts, M., Kenimer, J. G., Charles, I. G., Fairweather, N., Novotny, P., and Brennan, M. J. 
1991. Pertactin, an Arg-Gly-Asp containing Bordetella pertussis surface protein that 
promotes adherence of mammalian cells Proc. Natl. Acad. Sci. USA, 88:345-349). 

10 e) The RGD sequence is highly conserved in the VP1 protein of foot and mouth 

disease virus (FMDV). Attachment of FMDV to baby hamster kidney cells (BHK) 
has been shown to be mediated by VP1 protein via the RGD sequence. Antibodies 
against the RGD sequence of VP1 blocked attachment of virus to BHK cells (Fox, G., 
Parry, N. R., Barnett, P. V., McGinn, B., Rowland, D. J., and Brown, F. 1989. The 
15 Cell Attachment Site on Foot-and-Mouth Disease Virus Includes the Amino Acid 

Sequence RGD (Arginine-Glycine-Aspartic Acid) J. Gen. Virol., 70:625-637). 
It has been demonstrated that bacterial adherence can be based on interaction of a bacterial 
adhesin RGD sequence with an integrin and that bacterial adhesins can have multiple binding site 
characteristic of eukaryotic extracellular matrix proteins, RGD recognition is one of the important 
20 mechanisms used by microbes to gain entry into eukaryotic cells. 

The complete deduced protein sequence of the Chlamydia pneumoniae genome was searched 
for the presence of RGD sequence. There were a total of 54 ORFs that had one or more RGD 
sequences. Not all RGD containing proteins mediate cell attachment. It has been shown that RGD 
containing peptides that have proline immediately following the RGD sequence are inactive in cell 
25 attachment assays (Pierschbacher & Ruoslahti. 1987. Influence of stereochemistry of the sequence 
Arg-Gly-Asp-Xaa on binding specificity in cell adhesion. J. Biol. Chem. 262:17294-98). ORFs that 
had RGD, with proline as the amino acid following the RGD sequence were excluded from the list. 
Also, RGD sequence may not be available at the surface of the protein or may be present in a context 
that is not compatible with integrin binding. Since not all RGD- containing proteins are involved in 
30 cell attachment, several other criteria were used to refine the list of RGD- containing proteins. A list 
of ORFs in the Chlamydia pneumoniae genome encoding polypeptides with RGD recognition 
sequence(s) is in the specification. 

Non-Chlamvdia trachomatis ORFs 
35 Chlamydia pneumoniae ORFs were compared to the ORFs in the Chlamydia trachomatis 

genome (French patent applications FR97-15041, filed 28 November 1997 and 97-16034 filed 17 
December 1997) using Blastp. Any Chlamydia pneumoniae ORF with a Blastp P value worse than e* 
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, „. ^ against OM* — ORFs are indudcd in litis section. A list °* 



above in the specification 

Qdj Wall An'*™ Surface ORFs 



_ Prll Wnll An**™ smtace u^rs 

' u ^ th^ rpll wall of Gram-positive bactena via the 

Man, surface P*- are anc^ to the - » J ^ 

conned LPXTG motif (Schneewind, 0, Fowler, A, and Ml. KJ. .995 

Wall A-chor of Surface Prereto in S»p¥<««-« Scene 268:103-106). A scan 

ORFs was done using the ^ LPXTO. A lis. of ORFs ,„ .He ~ 



P— were deposited wire to Areencan Type C*re 
Collection (ATCQ, Rockville. Maryhmd. on Novereber 19, ,99g and assigned One access,™ 

It fi. order t„eoabrerecou„y of specific fragm sn,s o f *e c h _ one can ren ^ 
« — whose amplification products can ,fi=n be fenced and/oc Coned ,n,o an, 
suitable vector, acting .» anredand procedures known re .hose skilled in theurf. 

h addition. , sample of Ore* pools of clonea eovefing chromosomal regrens of .meres. 
weK d^Led with re. Areerfcnn Type Clrere Colleofion fATCC,, — 
November 19 199g and assigned .he indicated access,., number - . Each pool 
" sefiea o, Cones. When uken .ogother. Ore One. ^ols in ,he anreple cove. . ponren 
Illreosoree, wire . rednndare, of alighdy reore .han .wo. The - nnrebe, of Cones re .he 

sample is 196. 

The Cones cover die following three regions of inlerest: 
(i) position 30,000 .0 40,000 of SEQ ID No. 1 , referred to as region A; 
(ii> position 501 ,500 to 557,000 of SEQ ID No. 1. referred lo as region B; and 
(iii) p„ s iaon 8 15,000» 8 30,000ofSEQIDNo. 1. refe.red.oas regionC. 

Table 4 fiats gronps of oligonucleotides to be nsed re arephfy each of ORFs 2-129. 

Zi -a SEQ .D No, 1292 ,0 643.. For e«h ORE, Ore following is firfedt one forwar 
positioned 23300 bp upstream of the beginning of the ORE; one forward pnmer positioned ^ ^ 

Id one revere, prinrer » hP downstream „ the end of ORE, which is 2«3 bp - * 

end site of the ORE on the complementary stirred. The conespondtng SEQ ID No, for the 
pleT ace listed in Table 4, wltere Fp is the proaimn, forward primes-. Ed is .be dis.nl 
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primer; Bp is the proximal reverse primer; and Bd is the distal reverse primer. The positions 
of the 5' ends of each of these primers on the nucleotide sequence of SEQ ID No. 1 are shown in 
Table 5. 

Table 6 lists oligonucleotides (SEQ ID Nos. 6452-6843) to be used to amplify the inserts 
5 of each of the 196 clones present in the pooled sample according to standard procedures well 
known to those of skill in the art. These primers can also be utilized to amplify the chromosomal 
region corresponding to the region A, B or C within which the particular insert lies. Their 
positions are indicated in Table 7. 

The present invention is not to be limited in scope by the specific embodiments described 
10 herein, which are intended as single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the scope of the invention. Indeed, 
various modifications of the invention, in addition to those shown and described herein will 
become apparent to those skilled in the art from the foregoing description and accompanying 
drawings. Such modifications are intended to fall within the scope of the appended claims. 
15 All publications and patent applications mentioned in this specification are herein 

incorporated by reference to the same extent as if each individual publication or patent application 
was specifically and individually indicated to be incorporated by reference. 
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GRF Kos 


begin 


end \ 


jotential start 
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3393 
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155135 


1 C il "> AO 

154308 


155135 


130 


155141 


155467 


1 C C 1 /II 

155141 


131 


155703 


156779 


155703 


132 


1 5o/4o 


15 /635 


1 C./CI/IO 

lDO/^5 


133 


157653 


158996 


157653 


134 


159363 


159986 


159363 


135 


159880 


160446 


159880 


136 


160477 


160839 


160477 
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ORF Nos j 


begin 


: ! end 


potential start 


137 


160898 


161539 


160898 


138 


161527 


162153 


161527 


139 


162144 


162443 


162144 


~" 140 


162437 


164098 


162437 


141 


165451 


164228 


165451 


142 


166349 


165411 


166349 


143 


166949 


168442 


166949 


144 


169416 


171029 


169416 


145 


170857 


171459 


170857 


146 


172652 


173428 


172652 


147 


174626 


173439 


174626 


148 


174816 


175613 


174816 


149 


175598 


175954 


175598 


150 


175958 


176935 


175958 


151 


177708 


176938 


177708 


152 


177128 


177376 


177128 


153 


179472 


177841 


179472 


154 


| 179822 


179517 


179822 


155 


181793 


179943 


181793 


156 


182628 


181876 


182628 


157 


184420 


183074 


184420 


15? 


J 184988 184467 


184988 


155 


>T 18548] 


I 185112 


185483 


16( 


) 18590: 


> 185482 


( 185902 


16 


1 18617' 


t 18583< 


) 186174 


16 


2 18772( 


3 18658' 


7 187720 


16 


3 18831 


8 19093 


3 188318 


16 


4 19109 


0 19163 


5 191090 


16 


5 19154 


7 19274 


3 191547 


16 


,6 19296 


9 19346 




If 


17 19404 


A 19361 


0 194044 


It 


>8 1941S 


>6 1958C 


19 194196 


U 


59 19605 


58 1980' 


13 196088 


r 


70 1981: 


)2 1994! 


>4 198132 



WO 99/27105 



PCT/IB98/0I890 



141 



QRFNos 


begin 


ehd: : ;'t ; 


potential start 


171 


199351 


202818 


199351 


172 


204552 


202999 


204552 


173 


205648 


204692 


205639 


174 


205807 


207327 


205807 


175 


207182 


207775 


207182 


176 


207779 


208267 


207779 


177 


208267 


209577 


208267 


178 


211807 


211271 


211807 


179 


212188 


211844 


212188 


180 


214079 


212448 


214079 


181 


214907 


214083 


214907 


182 


216154 


215429 


216154 


183 


216115 


216678 


216115 


184 


216728 


217282 


216728 


185 


217267 


217866 


217267 


186 


218593 


218261 


218590 


187 


219821 


218994 


219821 


188 


221382 


220309 


221382 


189 


222719 


221433 


222719 


190 


223521 


222724 


223521 


191 


224499 


225008 


224499 


192 


225140 


225559 


225140 


193 


225555 


226802 


225555 


194 


227800 


226892 


227743 


195 


228335 


228072 


228335 


196 


229251 


228643 


229251 


197 


230983 


229622 


230983 


198 


231483 


230983 


231483 


199 


232063 


231509 


232063 


200 


232739 


232053 


232739 


201 


233166 


234356 


233166 


202 


233518 


233165 


233518 


203 


234536 


, 235186 


» 234536 


204 


23537S 


i 23668S 


► 235379 
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begin 


end J 


potential start : 


205 


236680 


237618 


236689 


206 


237521 


238345 


237521 


207 


238281 


238973 


238281 


208 


238871 


240115 


238871 


209 


240191 


241564 


240191 


210 


242281 


241604 


242281 


211 


242933 


242274 


242933 


~ 212 


243416 


242976 


243416 


213 


243500 


244531 


243500 


214 


244480 


246021 


244480 


215 


246330 


247811 


246330 


216 


247831 


249174 


247870 


217 


249437 


251038 


249455 


218 


251325 


252212 


251325 


219 


253156 


254007 


253156 


220 


253974 


254852 


253974 


221 


255258 


256094 


255258 


222 


256640 


257455 


256640 


223 


257502 


258239 


| 257502 


224 


257869 


257501 


j 257869 


225 


259248 


260897 


259248 


22e 


, 262753 


26178? 


262753 


22' 


1 26305S 


> 26275' 


l 263059 


221 


3 26437; 


> 26318: 


) 264375 


22 


9 26598 


5 26474' 


7 265985 


23 


0 26663 


7 26605 


9 266637 


23 


1 26733 


8 26653 


8 267338 


23 


2 26792 


2 26747 


3 267922 


23 


3 26964 


7 27077 


1 269647 


22 


14 2727' 


n 213V 


(5 272777 


2: 


$5 2732f 


>3 27361 


56 273253 


2' 


36 2737( 


)5 2739' 


77 273705 


2 


37 2760 


16 2757 


17 276016 


2 


38 2764 


39 276020 276418 
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; ; y oRF Nos ; " 


begin 

.;.y.;\;. : : ::••>•••••; 


end 


potential start 


239 


276792 


277253 


0*7i£ *7AO 

276792 


O >l A 

240 


277318 


277599 


0*7*7*i 1 C 


241 


278578 


0*7*7 0*7*7 


2 /o5 /o 


O A O 

242 


2 /92!>o 


2 /o554 


OOAOC Q 

2 /92jo 


0/1*1 

24 J 


ocA/ior 

ZoU4J> 


Z Iyj55 


ZoU4JD 


244 


2ol 54/ 


ZoUo4y 


AO 1 C/f *7 

2ol j4 / 


245 


O O 1 /T A^ 

281696 


OOOOOC 

282325 


281717 


O A £ 

246 


e> OO A C f\ 

282459 


/> o vl A^" A 

284069 


O OO A C C\ 

282459 


247 


284056 


284517 


284056 


248 


284606 


285775 


OO A£t\£ 

284606 


249 


285592 


O O AO*7 

285987 


AOf f AO 

285592 


250 


O O Z' 1 OA 

286179 


286976 


O 1 OA 

286179 


251 


O OTf OO 

287583 


287002 


O O *~tc o o 

287583 


252 


287951 


28745 1 


287951 


253 


288499 


288816 


O O O A C\C\ 

288499 


254 


289674 


o o o c t\e 

288505 


289674 


255 


O O 001 A 

288839 


o oni i o 

289213 


288839 


256 


289970 


290254 


O OAfnA 

289970 


257 


OA 1 AO 1 

291931 


292803 


OA 1 AO 1 

291931 


258 


293258 


292755 


OAOOCO 

293258 


259 


293718 


293272 


293718 


260 


O A A f O A 

294630 


293953 


O A A Z"0 A 

294630 


261 


296153 


294636 


296153 


262 


294817 


295068 


O A A O 1 O 

294817 


263 


296354 


O AO O/A 

297862 


296354 


264 


298415 


297879 


298415 


265 


298777 


298253 


298777 


266 


299572 


298781 


o/\/\^ IO 

299572 


267 


300487 


299633 


300487 


i 268 




JUO/02 


OA i c/CO 


269 


302440 


301571 


302440 


270 


302838 


302437 


302838 


271 


303335 


302745 


303335 


272 


304394 


303852 


304394 
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ORFNos , | 


begin 


end 


potential start 


273 


304606 


305223 


304606] 


274 


305394 


306236 


3053941 


275 


306501 


307439 


306501 


276 


308033 


307458 


308033 


277 


308924 


308037 


308924 


278 


309485 


310180 


309485 


279 


310426 


311214, 


310426 


280 


311597 


311253 


311504 


281 


312772 


311780 


312772 


282 


313425 


312772 


313425 


283[ 


313646 


313377 


313646 


284 


313937 


314665 


313937 


285 


315576 


314755 


3155761 


286f 


316157 


315531 


316157 


287 


318657 


316156 


3186571 


288 


321042 


318676 


321042 


289 


321445 


321098 


j 321445 


290 


322309 


321710 


| 3223091 
I 323181 


291 


P 323190 


322366 




292 


323843 


323181 


j 323843 


293 


324878 


32385( 


J 324878 


294 


\ 32534C 


l 3264K 


) 325340 


29f 


i 326431 


( 32783) 


S 326433 


29< 


3 32846! 


5 32783 


9 328465 


29' 


7 329361 


3 32885 


7 329360 


29 


8 33090 


7 32935 


7 330907 


29 


9 33245 


5 33095 


6 332455 


30 


0 33453 


6 3323S 


»5 334536 


30 


1 3360S 


»1 3348" 


n\ 336091 


3C 


>2 3361C 


)3 3373( 




3( 


)3 338i: 


19 3388. 


30 338129 


3< 


)4 3389( 


55 3 3 95 


01 338965 


3 


05 3395 


08 3401 


43 339508 


3 


06 3402 


47 3429 


67 340247 
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ORFNos 


begin 


• end 


potential start ; 


307 


343385 


343810 


343385 


308 


344171 


343935 


344171 


309 


345082 


O A A ^ 01 f\ 

344330 


345073 


310 


346005 


345082 


346005 


311 


346784 


346437 


346784 


312 


347029 


346715 


347029 


313 


347034 


347723 


347034 


314 


348075 


350459 


348075 


315 


350598 


351071 


350598 


316 


351075 


352175 


351096 


317 


353291 


352230 


353267 


318 


353442 


354467 


353442 


319 


354451 


354933 


354451 


320 


355000 


355449 


355000 


321 


355448 


356743 


355448 


322 


355953 


355642 


355953 


323 


359310 


356827 


359310 


324 


359120 


359377 


359120 


325 


359525 


359908 


359525 


326 


361290 


359947 


361290 


327 


363785 


361362 


363746 


j 328 


364496 


363888 


364496 


329 


364832 


365290 


364832 


330 


365304 


365669 


365304 


331 


366599 


365667 


366599 


332 


367291 


369030 


367291 


333 


369134 


369808 


369134 


334 


369917 


370438 


369917 


335 


370365 


372647 


370365 


336 


372557 


373066 


372DD7 


337 


373020 


373442 


373020 


338 


373467 


374195 


373467 


339 


374176 


375099 


374176 


34C 


) 375676 


375083 


375676 
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, ORFNos 


begin 


end, ' 


potential start 


341 


376173 


375634 


376173 


342 


376564 


377643 


376564 


343 


377956 


379773 


377956 


344 


379781 


380425 


379805 


345 


380281 


381000 


380281 


346 


381008 


381460 


381008 


347 


381460 


383037 


381460 


348 


383257 


383523 


383257 


349 


383553 


385304 


383553 


350 


385397 


386458 


385400 


351 


387242 


386514 


387242 


352 


388764 


387013 


388764 


353 


390120 


390932 


390120 


354 


390919 


391818 


390961 


355 


392379 


391885 


392379 


I" 356 


392582 


392986 


392582 


357 


392776 


393684 


392776 


f - 358 


394151 


394804 


394151 


359 


394928 


395308 


394928 


360 


395259 


395990 


395259 


\ 361 


397815 


395953 


397815 


362 


; 39885C 


) 397831 


398850 


36: 


\ 40008! 


> 39909S 


) 400085 


I 36- 


\ 40124 


5 40007: 


) 401236 


1 36 


5 40147 


4 40113 


6 401474 


36 


6 40219 


9 40142 


3 402199 


36 


7 40319 


3 40218 


6 403166 


1 36 


8 40365 


0 40416 


5 403650 


3( 


,9 4043^ 


13 40591 


4 404343 


y 


10 4059! 


$4 4073: 




1 3 


71 4077 


12 4088< 


)6 407712 


r™ 3 


72 4104 


39 4090 


75 410439 


i 3 


73 4118 


26 4109 


54 411826 


3 


74 4124 


82 4143 


02 412482 
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ORFHos 


begin 


end 


potential start 


375 


415402 


AAA A f\1 

414407 


41S4UZ 


376 


415848 


a 1 cin 

415237 


41 jo4o 


377 


417131 


415866 


41 / 1 J 1 


378 


A 1 T"* C O 

417258 


417566 


41 /ZJO 


379 


418326 


41 /4j4 


41o3Z0 


380 


420057 


41o4zo 


4ZUUD / 


381 


420448 


420720 


4zU445 


382 


420980 


421552 


/lOAAQA 


383 


421556 


422029 


A*> 1 C 

4215j6 


384 


422461 


422925 


42246 1 


385 


/|/>1 c 

423562 


4z43zU 


4Z3DOZ 


386 


424250 


424591 


42425U 


387 


Af\ 4 01 f\ 

424830 


426047 


/|A a Ol A 

424830 


388 


426240 


427397 


426240 


389 


428841 


430703 


428841 


390 


430694 


431446 


430694 


391 


431597 


432100 


/tit rm 

431597 


392 


432165 


432779 


432165 


393 


433272 


432832 


433272 


394 


433925 


433227 


/iT) ah 

433922 


395 


436678 


433934 


436678 


396 


437176 


438357 


437176 


397 


440317 


/i i o r 1 1 o 

438518 


A A AO 1 *7 

440317 


398 


440001 


A A A*5 /I C 

440345 


44U0U1 


399 


441233 


A Af\C 1 *7 

440517 


441233 


400 


440719 


441012 


a At\n i a 

440719 


401 


442192 


441230 


A A*\ 1 ftl 

442192 


402 


442888 


442343 


442888 


403 


442371 


442961 


442371 


A A/ 

404 


A A O C "70 

4435/0 


44JUUJ 


> 44jJ/o 


40f 


i 44450C 


I 44352f 


i 444500 


40( 


5 444842 


! 44452? 


444842 


40' 


7 44500$ 


) 44474: 


5 445009 


405 


4457 U 


44518: 


> 445718 
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ORFNos 


begin . : . 




potential start 


409 


445807 


447804 


445807 


410 


448738 


447803 


448738 


411 


449628 


448618 


449628 


412 


450298 


450867 


450298 


413 


450713 


451207 


450713 


r 414 


451211 


452452 


451211 


415 


452448 


453659 


452448 


416 


454843 


453725 


454843 


417 


455608 


454865 


455608 


418 


456243 


457007 


456243 


419 


457016 


457708 


4570161 


420 


458368 


457979 


458368 




459496 


458372 


459496] 


422 


459493 


460194 


459493 


423 


461446 


460355 


461446 


424 


462298 


461450 


462298 


425 


462444 


463349 


462444 


426 


464241 


463342 


464241 


427 


464574 


465065 


464574 


428 


465129 


465611 


465129 


429 


465571 


466317 


465571 


43C 


) 466317 


467093 


466317 


431 


1 46699S 


) 467502 


t 466999 


43: 


I 469691 


1 46771! 


5 469691 


43 


J 47069 


1 469661 


) 470691 


43 


4 47201 


0 47070 


9 472010 


i 43 


5 47154 


5 47179 


9 471545 


43 


6 47235 


9 47204 


5 472359 


43 


7 47352 


3 47273 


2 473523 


43 


8 47488 


9 47344 


i AH VtQQO 


42 


19 47732 


!3 4753* 


i5 477323 


* 


10 4784< 


)6 4775< 


)7 478496 


4- 


41 4787: 


n mi- 


73 478722 


4 


42 4792 


ll 47971 


05 479277 
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OKFNos 


begin . 


■end 

.- 


^potential start 


443 


480050 


481450 


480050 


444 


481469 


482053 


481469 


445 


482600 


482025 


482600 


446 


482654 


484204 


482654 


447 


484211 


485170 


A O A ^ 1 1 

484211 


448 


485170 


485838 


A O C 1 *7A 

485170 


449 


485813 


486580 


485813 


450 


486976 


486638 


486976 


451 


489071 


487764 


489071 


452 


489341 


489090 


489341 


453 


489958 


489152 


489958 


454 


490549 


489962 


490549 


455 


491163 


490522 


491163 


456 


491396 


491112 


491396 


457 


492121 


491390 


492121 


458 


492304 


494838 


492304 


459 


495943 


494822 


495943 


460 


496011 


496565 


496170 


461 


496569 


497228 


496569 


462 


497358 


497834 


497358 


463 


497770 


498327 


497770 


464 


499209 


499589 


499209 


465 


499520 


499792 


499520 


466 


500774 


504169 


500774 


467 


504139 


504600 


504139 


468 


504865 


506877 


504865 


469 


50679C 


507671 


506790 


47C 


> 507718 


510501 


507718 


471 


508325 


507912 


\ 508325 


47^ 


! 51066C 


t C 1 1 A At 

I 51344C 


1 MUooU 


47: 


J 51496f 


i 51378' 


J 514920 


47' 


\ 51734' 


1 51541< 


) 517347 


47; 


> 51705* 


I 51736! 


} 517058 


! 47< 


5 517795 


I 51727' 


7 517798 
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ORFNos ; 


begin 


end 


potential start 


477 


518200 


517847 


518200 


478 


518300 


521146 


518363 


479 


521392 


522948 


521407 


480 


523244 


524809 


523322 


481 


524379 


524125 


524379 


482 


524649 


526238 


524649 


483 


526265 


527104 


526268 


484 


526947 


526702 


526947 


485 


526975 


528450 


526975 


486 


528408 


529199 


528408 


487 


530612 


529542 


530612 


488 


531656 


530616 


531656 


489 


533974 


532067 


533974 


490 


536432 


534324 


536432 


491 


537150 


536707 


537150 


492 


5379281 


537080 


537928 


493 


538438 


537932 


538438 


494 


538737 


538333 


538737 


495 


539594 


j 539127 


539594 


496 


541215 


t" 539590 


541215 


497 


542571 


T~ 541282 


542571 


49? 


i 543011 


i| 54245" 


j 543014 


49< 


) 543365 


) 54296: 


I 543369 


50< 


) 54380' 


) 54662" 


8 543815 


50 


1 54661 


9 54952 


5 546619 


[ 50 


2 54729 


3 54699 


4 547293 


50 


3 54969 


9 55052 


3 549699 


5C 


>4 55049 


0 55155 


1 550490 


5( 


)5 5514^ 


18 55262 


>3 551448 


5( 


)6 5526? 


>2 5551 




51 


37 5550: 


29 5554' 


93 555029 


5 


08 5580 


06 5556 


73 558006 


5 


09 5596 


94 5581 


62 559694 


5 


10 5582 


08 5585 


73 558208 



WO 99/27105 
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begin. 


end 1 


potential start 


511 


561692 


559899 


561692 


512 


561412 


561708 


561412 


513 


563942 


561777 


563942 


514 


564969 


563950 


564969 


515 


566204 


564936 


can i ao 

566198 


516 


567717 


566302 


567717 


517 


568526 


567708 


568526 


518 


569467 


568742 


569467 


519 


571065 


569431 


571065 


520 


571828 


571118 


571783 


521 


572202 


573308 


572202 


522 


573146 


575056 


573146 


523 


575023 


575916 


575023 


524 


577891 


576497 


577891 


525 


578914 


578204 


578914 


526 


579924 


578857 


579924 


527 


580187 


579858 


580187 


528 


580017 


580406 


580017 


529 


581086 


580187 


581086 


530 


581367 


581828 


581367 


531 


581678 


582367 


581678 


532 


582361 


583428 


582361 


533 


584690 


583431 


584690 


534 


585237 


584950 


585237 


535 


585626 


586888 


585626 


536 


586846 


587907 


586888 


537 


589049 


588180 


589049 


538 


590500 


589301 


590455 


539 


590755 


592458 


590755 


540 


592526 


> 592903 


592526 


541 


592836 


» 59374"! 


' 592836 


542 


593741 


' 59429* 


593747 


542 


\ 594331 


59594' 


' 594331 


54^ 


\ 595905 


i 59630< 


) 595905 



PCT/IB98/01890 
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ORF Nos 


begia 


end 


potential start 


545 


596514 


597215 


596514 


546 


597184 


597957 


597184 


547 


597755 


598612 


597755 


548 


598602 


599204 


598602 


549 


599373 


599939 


599373 


550 


600903 


602072 


600903 


551 


602240 


602587 


602240 


552 


602637 


603272 


602637 


553 


603142 


604512 


603142 


554 


604627 


605853 


604627 


555 


605790 


606620 


605790 


556 


606571 


607281 


606571 


557 


609004 


607355 


609004 


558 


610906 


609932 


610906 


559 


611786 


611004 


611786 


560 


612333 


611746 


612333 


561 


613897 


612341 


613897 


562 


615179 


616279 


615179 


563 


616610 


617383 


616610 


564 


618796 


617810 


1 618796 


565 


620004 


618826 


1 620004 


566 


61964S 


i 619918 


j 619649 


56-; 


r 621 26f 


i 620021 


j 621265 


561 


I 622351 


) 621265 


; 622359 


5« 


) 62342( 


) 62256( 


) 623420 


571 


3 62429 


7 62333; 


> 624297 


57 


1 62477 


3 62417' 


4 624773 


57 


2 62502 


9 62548 


4 625029 


57 


3 62548 


8 62588 


3 625488 


57 


4 62589 


»2 62639 




51 


'5 62644 


14 62779 


0 626444 


5' 


16 62791 


[2 6286C 


)1 627930 


5" 


11 6287' 


74 6296S 


>7 628774 


5' 


78J 6296i 


60 6316: 


\9 629660 
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ORF Nos 


begin : 


end 


potential start 


579 


63 1 725 


^ C C 1 

633551 


1 T^C 

631725 


C O A 

580 


coa 

633520 


636957 


ZTO *> CIA 

633520 


581 


637232 


638098 


63 1 151 


582 


64U648 


639593 


640648 


583 


64U9/9 


640 /2o 


o40y fy 


COyl 

584 


a a 1 ion 
641327 


64 100/ 


CA 1 11*7 

64132/ 


CO c 

585 


641687 


^v1T> 0"> 

642283 


HA 1 /COT 

641687 


586 


643023 


Z" Ol£ 

642286 


643023 


587 


643330 


S~ A") A? /T 

643076 


ZT A 1 *> O A 

643330 


588 


643704 


643351 


643704 


589 


645628 


643676 


645628 


590 


645783 


Z* /I C O O 

645538 


645756 


591 


646269 


645793 


646269 


592 


646751 


646314 


646751 


593 


647848 


647045 


647848 


594 


648393 


650336 


ZTvl Ol A*> 

648393 


595 


651016 


650420 


s~ c \ AA*7 

651007 


596 


652956 


s- c 1 O OA 

651289 


^ one n 

652956 


597 


653395 


653126 


z; ct> Ac 

653395 


598 


655740 


654193 


655740 


599 


656508 


655966 


S r £ C AO 

656508 


600 


658140 


657022 


658140 


601 


660216 


658525 


ztzta'"* i z; 

6602 1 6 


602 


663238 


660248 


663238 


603 


£ /l A A 1 

664461 


663157 


C £ A A C"» 

664452 


604 


665735 


664635 


z*z* ni c 

665735 


605 


666212 


666994 


666212 


606 


666998 


667921 


666998 


607 


667909 


668568 


667909 


oUo 


ooojOz 


OOV2U3 




609 


669154 


670893 


669175 


610 


672226 


670853 


672226 


611 


671137 


671424 


671137 


612 


672453 


673001 


672453 



PCT/IB98/01890 

WO 99/27105 

154 



ORFNos 


begir. 






613 


673072 


674721 


✓""■J^ ATI 

673072 


614 


674549 


674262 


674549 


615 


675518 


674796 


675518 


616 


676083 


675499 


676083 


~ 617 


676630 


676067 


676630 


618 


677016 


676600 


677016 


619 


677647 


677015 


677647 


620 


677990 


678259 


677990 


621 


679444 


680097 


679444 


622 


680097 


680897 


680097 


623 


681637 


680849 


681637 


624 


681409 


682281 


681409 


625 


682453 


682821 


682453 


626 


682763 


683902 


682763 


627 


684616 


683969 


684616 


628 


685169 


684534 


685169 


629 


685986 


685117 


685986 


630 


686278 


687288 


686278 


631 


687483 


688151 


j 687483 


632 


688740 


689501 


688740 


633 


690242 


689622 


690242 


634 


\ 690470 


691126 


J 690470 


63f 


i 69260C 


1 69149' 


n 692600 


63( 


> 69267-: 


\ 69506- 


\\ 6926741 


63' 


7 69504* 


) 69603: 


l\ 695064 


63 


8 69796- 


1 69658 


5 697964 


63 


9 69980 


3 69827 


4 699803 


64 


0 70192 


6 69978 


8 701926 


64 


1 70319 


6 70256 


7 703196^ 


64 


\2 70422 


1 70321 






13 70424 


\0 10521 


19\ 704240 1 


6- 


14 7060' 


ro 7053( 


)0 706070 


6' 


*5 7068' 


11 7062. 


54 706838 


6 


46 7075' 


)6| 7068 


11 707596 
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CXRF Nos 


begin 


end 


potential start 


647 


nc\Q cue 
708666 


707677 


708666 


648 


t a a*7 no 

709793 


709119 


T AAT AO 

709793 


649 


*7 1 1 Ol 

71 1523 


710132 


T 1 1 00 

71 1523 


650 


T 1 HO H 

712236 


T 1 1 CO/O 

71 1523 


712236 


651 


H 1 All A 


712125 


*7 1 /I TO /t 

714734 


^ co 

652 


715759 


*7 1 A nc 1 

714761 


715759 


653 


717538 


715886 


717538 


£ C A 

654 


T 1 A 1 11 

719113 


720243 


719113 


f c r 

655 


T1 A C AA 

720590 


IaUAlL 


11 A£" AA 

720590 


656 


Tlj T il A/" 

722406 


TO *^ f\ C f 

723056 


TT1 A Ay* 

722406 


657 


723551 


723120 


723551 


658 


724246 


723626 


724246 


659 


724754 


724251 


724754 


660 


725868 


724900 


725868 


661 


727115 


726270 


727115 


662 


728126 


i « /-v 

727119 


728126 


663 


728594 


TO O^AO 

728208 


T"* or" A A 

728594 


664 


729614 


TO O /*A yl 

728604 


729614 


665 


T/% /"VT^O 

729778 


729533 


729778 


666 


730149 


72975 l 


730149 


667 


730539 


730174 


730539 


668 


TO 1 

731983 


730598 


TO 1 AOO 

731983 


669 


732427 


TO 1 A A Z° 

731996 


TO T vl TT 

732427 


670 


732917 


732423 


732917 


671 


»7<% «*« C AO 

733598 


TO O O OA 

733320 


TO O AO 

733598 


672 


733869 


733492 


733869 


673 


734298 


733900 


734298 


674 


734858 


734319 


734858 


675 


735195 


734863 


735195 


una 

676 


735578 


735342 


TO C CTO 

735578 


677 


735861 


735604 


735861 


678 


736492 


736079 


736492 


679 


737192 


736524 


737192 


680 


737555 


737211 


737555 
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ORFNos 


begin 


sad- I 


potential start 


681 


738688 


737837 


738688 


682 


739048 


738713 


739048 


683 


739736 


739065 


739736 


684 


740477 


mm 


740477 


685 


740659 


740958 


740659 


686 


741722 


740721 


741722 


687 


742789 


741827 


742789 


688 


743618 


742782 


743618 


689 


744092 


743634 


744092 


690 


744604 


744107 


744604 


691 


744953 


744498 


744953 


692 


746608 


744986 


746608 


693 


747085 


746621 


747085 


j 694 


747974 


747219 


747974 


h 695 


748594 


748169 


748594 


| 696 


749145 


748573 


749145 


! 697 


749652 


749957 


749652 


1" 698 


750446 


749979 


750446 


|~ 699 


751219 


750446 


751219 


700 


753042 


751291 


753042 


|~ 701 


754309 


753020 


754309 


j - 702 


755 12C 


I 756175 


755120 


j 703 


75612C 


) 756485 


i 756120 


70^ 


\ 75649' 


) 76022' 


1 756499 


j 70; 


> 76121' 


7 76029' 


7 761178 


f 70< 


5 76129 


7 76180 


9 761330 


P 70 


7 76178 


2 76228 


2 761782 


r 70 


8 76226 


0 76289 


5 762299 


r 70 


9 76286 


7 76331 


6 762867 


r 71 


0 76378 


,0 76332 


»5 763 /80 




1 763 8( 


A 765 1( 


58 763861 


r 71 


12 76681 


)9 7656! 


)7 766809 


1 7 


13 7680. 


51 7668 


88 76805 F 


r 7 


14j 7685 


66 7683 


21 768566 
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• f;;V.O^ : :Nb$;': : ;' : 


begin 




^potential start •, 


715 


769342 


768551 


769342 


716 


770532 


AO TO 

769378 


770532 


717 


771451 


770804 


771451 


718 


773058 


77 1 847 


773058 


719 


773094 


773456 


773094 


720 


774376 


773093 


774376 


721 


775123 


n vt o o/\ 

774380 


775123 


722 


775398 


774916 


*7*J f 1 AO 

775398 


723 


775046 


776077 


775046 


724 


776070 


777041 


776070 


725 


777964 


777536 


777964 


726 


778176 


777904 


778176 


727 


778621 


779334 


778684 


728 


781173 


780307 


781173 


729 


781526 


781116 


781526 


730 


782784 


781555 


782784 


731 


783572 


TOO, O AC 

782805 


783572 


732 


785032 


783581 


785032 


733 


786412 


*7 O O /"A 

785360 


786412 


734 


788429 


786450 


too ^ in 

788429 


735 


788944 


788528 


788944 


736 


789758 


1 O OA A 1 

788901 


789758 


737 


790332 


791504 


790338 


738 


791846 


79272 l 


791846 


739 


792724 


793569 


792724 


740 


793580 


794323 


793580 


741 


794304 


794843 


794304 


742 


795217 


795732 


795217 


743 


795722 


796795 


795722 


HA A 
/44 


798735 


/y /Uj3 


798735 


745 


799823 


798681 


799823 


746 


799297 


799578 


799297 


747 


801313 


799808 


801313 


748 


802453 


801332 


802453 



WO 99/27105 
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ORFNos 


begin . | 


end \ 


potential start 


749 


803299 


802457 


803299 


750 


803811 


803290 


803811 


751 


805151 


803826 


805151 


752 


805860 


805156 


805860 


753 


806604 


806332 


806604 


~~ 754 


806913 


806608 


806913 


755 


808222 


806903 


808222 


756 


808751 


808146 


808751 


757 


809437 


808673 


809437 


758 


809939 


809454 


809939 


759 


811235 


810213 


811235 


760 


811779 


813056 


811779 


761 


812890 


812516 


812890 


j 762 


812954 


813583 


812954 


j 763' 


813587 


815023 


813587 


J 764 


815420 


815746 


815420 


r~ 765 


816036 


817010 


816036 


j 766 


817111 


817356 


817111 


\ 767 


817791 


818609 


817797 




818609 


819094 


818609 


j 76S 


) 819104 


819823 


819104 


j 77( 


) 820722 


81982( 


> 820722 


1 77 


L 8223K 


\ 82100( 


) 822313 


77 


2 82350: 


I 82223 


g 823503 


j 77 


3 82367 


8 82561 


2 823678 


j 77 


4 82546 


1 82631 


2 825461 


| 77 


5 82728 


0 82664 


5 827280 




r6 8286C 


>4 82711 


i\ 828604 


r 7 


77 8300; 


16 82871 


13 830026 


7* 


78 8310' 


H 83001 




7 


79 8317 


25 8310 


51 831725 


7 


80 8322 


20 8330 


98 832220 


7 


81 8338 


51 8333 


96 833851 




'82 8340 


68 835C 


,39 834068 



WO 99/27105 
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J: -ORF Nos:. 


begin , 


. end | potential start 


783 


835792 


835127 


835792 


784 


837624 


836116 


837624 


785 


838951 


840882 


838951 


786 


840869 


842185 


840869 


787 


841989 


843455 


841989 


788 


843242 


844021 


843242 


789 


845018 


843987 


844997 


790 


846174 


844990 


846174 


791 


848509 


846311 


848509 


792 


848568 


849014 


848568 


793 


849082 


850488 


849088 


794 


851512 


850574 


851512 


795 


852064 


852447 


852064 


796 


852398 


853690 


852398 


797 


855118 


854243 


855118 


798 


855751 


855128 


855751 


799 


856551 


855829 


856551 


800 


856730 


858556 


856730 


801 


858717 


859601 


858717 


802 


859591 


860205 


859591 


803 


861132 


860284 


861132 


804 


861426 


861163 


861426 


805 


861701 


862921 


861701 


806 


863026 


864798 


863026 


807 


864831 


865256 


864831 


808 


865226 


866581 


865226 


809 


866562 


867119 


866562 


810 


867025 


867816 


867025 


811 


867820 


868497 


867820 


812 


869743 


868661 


869743 


813 


870633 


870094 


870633 


814 


871929 


870646 


871929 


815 


872538 


872086 


872538 


816 


873908 


872517 


873908 



WO 99/27105 
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ORF Nos 



817 



874281 



818 



874582 



819 



877857 



820 



878446 



821 



880635 



822 



882524 



874670 



875286 



875377 



879255 



879268 



potential start 



880593 



823 



882612 



824 



884155 



825 



884340 



826 



885722 



827 



887587 



883319 



883538 



88561 1 



887302 



888153 



828 



888627 



829 



889330 



830 
"8311 



889898 



891190 



832 



891828 



833 



892421 



834 



893116 



835 



892521 



836 



893392 



837 



895745 



838 



896668 



839 



897565 



888220 



888716 



889323 



889898 



891247 



892017 



892421 



892925 



895419 



896527 



897558 



899442 



874281 



874582 



877857 
878446 



880635 
882524 



882612 



884155 
884343 



885722 



887587 



888627 



889330 



889898 



891190 



891828 
892421 



893116 
892521 



893392 
895745 



896668 
897565 



840 


899420 


900229 


899420 


841 


903230 


900237 


903230 


842 


905081 


903234 


905081 


843 


906931 


905045 


906931 


844 


907248 


907832 


907299 


845 


907784 


908128 


907784 


846 


908132 


908677 


908132 


847 


908589 


909320 


[ 908589 


84* 


909405 


i 911465 


| 909405 




) 91 167 ' 


1 912360 


J 911725 


85( 


) 91230: 


] 912821 


912303 



PCT/IB98/01890 
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begin 


:end 


potential start 


851 


91 2937 


913983 


A 1 OA! ~1 

912937 


852 


915128 


914067 


915128 


853 


916658 


A 1 COAT 

915303 


916658 


854 


915627 


915376 


915627 


o c c 

855 


917707 


ai /TOCO 


y\ 1 /U / 


Of/ 

856 


918837 


917722 


A 1 OOn 

918837 


857 


919868 


/-v i oon 

918837 


A 1 AO/TO 

919868 


858 


920434 


919880 


920434 


859 


921187 


920438 


921187 


860 


921959 


921195 


92 I 959 


861 


923773 


921995 


923773 


862 


922146 


922415 


922146 


863 


923943 


923674 


923943 


864 


924077 


925006 


/•v j Ann 

924077 


865 


925436 


925083 


925436 


866 


926524 


925349 


926524 


867 


927920 


926433 


927920 


868 


928319 


92795 1 


928319 


869 


928963 


928334 


928963 


870 


929248 


930987 


/A ^ AI 40 

929248 


871 


930995 


932059 


930995 


872 


932121 


933515 


932175 


873 


932881 


932513 


932881 


874 


933485 


935746 


933485 


875 


935724 


937082 


935724 


876 


937229 


938410 


937229 


877 


938281 


938805 


938281 


878 


938809 


939255 


938824 


879 


939165 


939782 


939165 


OOA 

880 


939760 


940791 


939790 


881 


940822 


941106 


940822 


882 


940977 


941351 


940977 


883 


942537 


941623 


942429 


884 


942784 


942500 


942763 
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ORF Nos 


begin 




potential start : 


885 


943149 


942799 "~ 


943149 


" 886 


943799 


943029 


943799 


887 


944055 


943732 


944055 


888 


944413 


943994 ~~ 


944404 


889 


945395 


944556 


945395 


890 


945853 


945389 


945853 


891 


946392 


945751 


946392 


r" 892 


947410 


948081 


947431 


893 


949871 


948915 


949871 


r 894 


951058 


949868 


951058 


r" 895 


951249 


950959 


951249 


r~ 896 


951664 


952134 


951664 


897 


952674 


952165 


952674 


P 898 


953491 


952589 


953491 




899 


955324 


953495 


955324 




900 


955823 


955281 


955823 


r 901 


957082 


955847 


957082 


| 902 


957902 


957270 


957902 


903 


959231 


957906 


959231 


J 904 


959376 


960284 


959376 


r~ 905 


960266 


961669 


960347 


[ 90( 


, 96185( 


> 96476f 


, 961856 


| 90' 


1 96685! 


j 96539! 


> 966855 


90S 


j 96820' 


% 96697 


5 968204 




90 


9 96879 


1 96823 


7 968791 




91 


0 96949 


8 96873 


1 969498 




91 


1 96985 


8 96951 


1 969858 




91 


2 97011 


8 96976 


a 970118 




91 


3 97055 


>3 9703( 


)0 970593 




91 


14 9712< 


>1 9705< 


I2 y/i/oi 




9 


15 9716 


SO 9711 


23 971680 




9 


16 9718 


76 9751 


00 971876 




9 


17 9754 


19 9765 


16 975419 




1 


18| 9765 


84 9783 


20 976584 
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: ORF N os • 


begin • 


: : ;:; : :::;::.,end : ; 


potential start 


919 


977680 


977231 


mn / oa 

977680 


920 


978399 


AO AT> O 

980738 


AOOO A A 

978399 


921 


980756 


AO 1 AOO 

981928 


A O A*7C 

980756 


aoo 

922 


982974 


AO 1 AO 1 

981931 


982962 


923 


r\ o /t 1 OA 

984120 


no i i a 

9831 19 


AO >l 1 

984120 


924 


AO C C AO 

985502 


AO A 1 OA 

984120 


AO C C AO 

985502 


925 


987180 


985882 


no *7 i o/\ 

987180 


926 


987172 


987444 


987172 


927 


A n Art y" 

989846 


989049 


989846 


928 


991048 


989846 


991048 


929 


991638 


990955 


AA 1 ✓* O 

991638 


930 


991794 


992498 


991794 


931 


993619 


993041 


993619 


932 


993530 


994792 


993548 


933 


995970 


994795 


995970 


934 


996857 


995739 


996857 


935 


997603 


996782 


997603 


936 


998969 


r\ at\ « » m a 

997572 


/-v *r\ ri /~v 

998969 


937 


998896 


1000023 


998896 


938 


1000087 


1001340 


1000087 


939 


1001357 


1001818 


1001357 


940 


1003288 


1001873 


1003288 


941 


1003487 


1004146 


1003496 


942 


1004485 


1005639 


1 f\f\ A S O C\ 

1004689 


943 


1005643 


1005972 


1005643 


944 


1006784 


1006116 


1006784 


945 


1007563 


1006769 


1007563 


946 


1009226 


1007568 


1009226 


947 


1009989 


1009336 


1009989 


948 


i A« e ctc^ 

1015852 


1 A 1 ✓'O O 'I 

1016337 


1 A 1 C O C O 

1015852 


949 


1016561 


1016181 


1016561 


950 


1016297 


1017532 


1016297 


951 


1016802 


1016452 


1016802 


952 


1018993 


1017701 


1018993 
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ORF^os 


begin 


end 


potential start 


953 


1019454 


1019137 


10194541 


954 


1020764 


1019562 


1020764 


955 


1021405 


1021037 


1021405 


956 


1021821 


1024286 


1021821 


957 


1024697 


1024248 


1024697 


958 


1025569 


1024508 


10255511 


959 


1026969 


1025590 


102696"9| 


~~ 960 


1027789 


1026947 


1027789 


961 


1031199 


1027945 


1031199 


962 


1031717 


1031172 


1031717 


963 


1033057 


1031612 


1033057 


964 


1033425 


1033039 


1033425 


965 


1033784 


1033200 


1033784 


966 


1033963 


1036038 


10339631 


967 


1036945 


1036010 


1036945 


968 


1037110 


1037679 


1037110 


969 


1037696 


1037944 


1037696 


J 970 


1038916 


1037975 


1038916 


971 


1040582 


1039026 


1040582 


972 


1040997 


1042337 


1040997 


973 


1042357 


1043403 


1042357 


974 


I 104336 - / 


1044622 


1043367 


97J 


> 104460' 


1 104536: 


> 1044607 


97< 


5 104538' 


t 1046531 


} 1045384 


97' 


7 104644' 


7 104751' 


7 1046447 


97 


8 104752 


1 104995 


6 1047521 


97 


9 105061 


1 105003 


6 1050611 


98 


0 105092 


5 105056 


6 1050925 


98 


1 105172 


8 105109 


0 1051728 


95 


12 105174 


105206 




9i 


$3 1052K 


)1 10531: 


16 1052101 


9! 


34 10542( 


)1 1053H 


)7 1054201 


9 


85 10542- 


42 10555 


55 1054242 


9 


86 10554 


83 10559 


08 1055483 
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;;:fORFNas : - 

y;.: :: ;:.v : - ■ • : - ' • 


. begin : 


> 

-v:::;;-:;:endi; : : ; ;v,:-^. 


potential' start 


987 


1056609 


1056965 


1 A C f t~ (\f\ 

1056609 


988 


1056961 


1 AC O^O^ 

1058232 


1056985 


989 


1058238 


1058687 


1 AC OOI O 

1058238 


AAA 

990 


1059371 


1 AC OT>T 

1058727 


1 AC AO*7 1 

1059371 


991 


1 Af AC 

1059526 


1 A/T AC "7 O 

1060578 


1059526 


aao 

992 


1A/1 C C*} 

1061553 


1060579 


1061553 


993 


1061674 


1062411 


1061674 


994 


1062377 


1 AX* /I /\^^ 

1064077 


1062377 


995 


1064116 


1065243 


1064116 


996 


1067451 


1065178 


1067451 


997 


1068065 


1067376 


1068065 


998 


1 f\ f Cl f\ f\(\ 

1068209 


1068706 


1068230 


999 


1069958 


1068819 


1069958 


1000 


1071163 


1070033 


1071163 


1001 


1072438 


l 071332 


1072438 


1002 


1072997 


1073476 


1072997 


1003 


1074239 


1075864 


1074239 


1004 


1076790 


1075867 


1076790 


1005 


1 ATf^Z" O 

1077268 


1076573 


1077268 


1006 


1 AT1AAA 

1077999 


1078724 


1077999 


1007 


1079088 


1078672 


1079088 


1008 


1079642 


1079944 


1079642 


1009 


1080501 


1079995 


1080468 


1010 


1080775 


1 A O t ^ A 1 

1081341 


1 AO ATI C 

1080775 


1011 


1083158 


1 AD 1 T CA 

1081350 


1 AO! 1 CO 

1083158 


1012 


1 Art i /«n 

1084677 


1083235 


1 AO A Z*T7 

1084677 


1013 


1085648 


1084632 


1085648 


1014 


1086117 


1086737 


1086117 


1015 


1086692 


1087897 


1086692 


i a i a 

1016 


1088646 


1 AOAAAC 

1089005 


1088646 


1017 


1089146 


1089805 


1089146 


1018 


1092931 


1089890 


1092931 


1019 


1093179 


109288S 


i 1093179 


1020 


1093584 


1094204 


1093584 
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ORFNos 


begvn 


end ■ 


potential start 


1021 


1095619 


1094192 


1095619 


1022 


1096074 


1096628 


1096074 


1023 


1096633 


1097082 


1096633 


1024 


1097266 


1097601 


1097266 


1025 


1097622 


1097867 


1097622 


1026 


1097886 


1098392 


1097886 


1027 


1099521 


1099279 


1099521 


1028 


1099689 


1101053 


1099704 


1029 


1102192 


1101107 


1102192 


1030 


1104950 


1102116 


1104950 


1031 


1106508 


1104946 


1106508 


1032 


1106722 


1107249 


1106722 


1033 


1 107463 


1108101 


1107463 


1034 


1108041 


1108421 


1108041 


1035 


1108520 


1113370 


1108520 


1036 


1114958 


1113447 


1114958 


1037 


1116915 


1115071 


1116915 


1038 


1118183 


1116894 


1118183 


1039 


1118846 


1120030 


1118846 


1040 


1120040 


1 120522 


1120040 


1041 


1120510 


1121430 


1120510 


1042 


1121321 


1121866 


1121321 


104: 


\ 1122122 


( 112289S 


) 1122123 


10* 


» 112484: 


I 11255& 


t 1124842 


104 


5 112652< 


5 112557! 


) 1126526 


104 


6 112651 


9 112767 


6 1126519 


104 


7 112767 


2 112857 


1 1127672 


104 


8 113023 


0 113133 


6 1130230 


104 


9 113148 


0 113255 


3 1131480 


105 


0 113283 


0 113384 




lOf 


il 113412 


»1 113485 


,5 1134121 


io: 


32 U34fr 


12 11355* 


)2 1134642 


10. 


53 11359* 


54 113561 


53 1135964 


10 


54 U371 


32 11359 


54 1137132 
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1 ORFNos 


begin 


end : 


potential start 


1055 


H37169 


1140102 


1 1 O *7 1 £.t\ 

1137169 


1056 


l 141365 


1 1401 12 


1 1 A 1 O A A 

1 141344 


1057 


H42150 


1 1 Ml Of / 

1141356 


1 142150 


1058 


1142520 


1 145660 


1 142520 


1 AC A 

1 059 


1 145627 


1 1 /UTT^ 1 

1 146721 


1 145627 


1 060 


1 146862 


1147545 


1 1 yl/ OS'S 

1 146862 


1061 


1147666 


H48190 


1147666 


1 062 


1148514 


l 148224 


1148514 


1063 


1149136 


1148348 


1149136 


1064 


1 149702 


1149166 


1 149702 


I065 


1150031 


1150591 


1150031 


1066 


1150785 


1151147 


1150785 


1067 


1151165 


1152181 


1151165 


1068 


1152522 


1154591 


1152522 


1069 


1155666 


1154566 


1155666 


1070 


1156743 


1155670 


1156740 


1071 


1156859 


1157815 


1156859 


1072 


1157982 


1160735 


1157982 


1073 


1162620 


1160917 


1162620 


1 074 


1162970 


1162590 


1162970 


1075 


1163532 


1164020 


1163532 


1076 


1163995 


1164294 


1163995 


1 077 


1165569 


l 165030 


1165569 


1 078 


1166108 


1165566 


1166108 


1 079 


1166644 


1166141 


1166644 


1080 


1167055 


1168374 


1167055 


1081 


1169218 


1168337 


1169218 


1082 


1169823 


1169218 


1169823 


1083 


1171324 


1170572 


1171324 


1084 


1172085 


11111 n 

1171177 


1 172085 


1085 


1172394 


1173773 


1172394 


1086 


1175209 


1173881 


1 175209 


1087 


1175555 


1175127 


1175360 


1088 


1175778 


1177043 


1175778 
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ORFNos 


begin | 


end 


potential start 


1089 


1177177 


1179048 


1177177 


1090 


1179156 


1180085 


1179156 


1091 


1180045 


1180779 


1180045 


1092 


1181942 


1180788 


1181942 


1093 


1182296 


1181961 


1182296 


1094 


1183844 


1182300 


1183844 


^ 1095 


1184420 


1183848 


1184420 


1096 


1185382 


1184366 


1185382 


1097 


1185858 


1185226 


1185858 


1098 


1186164 


1186481 


1186185 


1099 


1187386 


1186484 


1187386 


1100 


1187370 


1189028 


1187370 


1101 


1189321 


1190889 


1189321 


\~~ 1102 


1191142 


1192146 


1191142 


1103 


1191974 


1191729 


1191974 


p 1104 


1193815 


1192991 


1193815 


[~~ 1105 


1195702 


1194248 


1195702 


j 1106 


1196303 


1195716 


1196303 


r~ iio7 


1196831 


1196337 


1196831 


J 1108 


1 197807 


1196746 


1197651 


1109 


i 1198740 


1197883 


1198668 


[ 111C 


> 1200232 


1198721 


1200232 


1 1111 


I 120128< 


i 1200135 


i 1201286 


r~ in: 


I 120238< 


S 120125? 


) 1202350 


1 111 


3 120290 


1 120235< 


) 1202901 


1 111 


4 120416 


2 120281 


6 1204162 


j in 


5 120317 


7 120346 


4 1203177 




6 120502 


8 120418 


0 1205028 


1 111 


7 120639 


2 120487 


8 1206392 




18 120674 


\2 12060S 


;6 1206742 


1 11 


19 12078' 


12 12067: 


14 1207872 


1 11 


20 12088: 


>2 120781 


51 1208852 


n 


21 12105 


18 12097 


42 1210518 


1 11 


22 12107 


03 12114 


94 1210703 
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ORFNos 


begin 


end 


! .. potmtiali sta#;£: 


1123 


1211870 


1212754 


t ai in *7/\ 

121 1870 


1124 


1212742 


1214064 


1212742 


1125 


1214046 


1214858 


1214046 


! 1126 


1215551 


1216318 


1 O 1 C C C 1 

1215551 


1127 


1216493 


1216849 


1216493 


1128 


1217183 


1219612 


1217183 


1129 


1220068 


1219673 


1220068 


1130 


1219710 


1220669 


1219710 


1131 


1220630 


1221376 


1220630 


1132 


1221645 


1223681 


1221645 


1133 


1223894 


1224988 


1223900 


1134 


1225000 


1225830 


l 22 5 000 


1135 


1227810 


1225879 


1227810 


1136 


1226528 


1226908 


1226528 


1137 


1229972 


1228311 


1229972 


1138 


47569 


47018 


47569 


1139 


49980 


49117 


49980 


1140 


53356 


52898 


53356 


1141 


54477 


54884 


54477 


1142 


63753 


63998 


63753 


1143 


77164 


77487 


77164 


1144 


79724 


79302 


79724 


1145 


88721 


88951 


88721 


1146 


94067 


94429 


94067 


1147 


122832 


123341 


122832 


1148 


147536 


147234 


147536 


1149 


158990 


159346 


158990 


1150 


168470 


168979 


168470 


1151 


169183 


169452 


169204 


1 152 


171785 


171504 


1/1 /05 


1153 


172518 


171775 


172518 


1154 


193599 


194045 


193599 


115! 


195704 


1 19607f 


195704 


nst 


i 21068 1 / 


210145 


i 210684 
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. OREjNos 


begin 




potential start 


1157 


211100 


210708 


211100 


" 1158 


215420 


215088 


215420 


1159 


217914 


218246 


217914 


r 1160 


218925 


218701 


218925 


j 1161 


223785 


223525 


223785 


J" 1162 


224271 


223999 


224271 


j 1163 


228691 


228407 


228691 


j 1164 


235050 


235334 


235050 


| 1165 


252308 


253021 


252308 


i 1166 


258280 


258912 


258280 


j 1167 


261325 


261567 


261325 


\ 1168 


268195 


268878 


268195 


j 1169 


269447 


268881 


269447 


j 1170 


271263 


271538 


271263 


j 1171 


271957 


272346 


271957 


j 1172 


274176 


274550 


274176 


r 1173 


275736 


275314 


275736 


h~ 1174 


276490 


276927 


276490 


|" 1175 


277577 


277861 


277577 


1176 


288163 


287909 


288163 


j - 1177 


290130 


289789 


290130 


| 1178 


29098S 


i 291225 


290989 


i 1175 


i 291372 


! 29186C 


> 291372 


118( 


> 31123S 


) 31162: 


\ 311239 


118' 


I 32866! 


j 32838' 


\ 328665 


r 118: 


I 33734! 


5 33828! 


} 337348 


118 


3 36476 


4 36436 


9 364764 


j 118 


4 38962 


3 39013 


5 389623 


118 


5 39372 


9 39434 


3 393729 


1 118 


6 40737 


9 40762 


1 hu / .> / y 


1 118 


7 41094 


4 41070 


8 410944 


] 118 


8 42763 


,2 4279? 


8 427632 


m 


!9 4281 - ; 


12 4284J 


56 428172 


n< 


)0 4367< 


31 4372' 


16 436761 
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begin . 




potential start; 


l iy i 


/I£A01 1 

40uyi i 


4o 1 1 jy 


/1£.AA1 1 

4ooy i i 


1 1 go 
1 iyl 


4 / /jy 1 


4 / /J 13 


/n*7CA7 
4 / /jy 1 


1 1 oi 




/1QOAA1 


HO fjVJj 


1 1 a a 
1 ly4 


4o / /04 


/i B7C.1/1 

4o /!)34 


4o/ /o4 




^VojUZ 


/loom 7 
4yyui / 




i iyo 


/1QQ7Q^ 


DUU400 


/10Q7QC 


1 i 07 

i iy / 


c*7 i OOQ. 

j/ iyzo 


j /Z344 


c*7 1 AO 0 

j / iyzo 


1 1 no 

1 iy© 


5 /23o/ 


5/2131 


J /ZiO/ 


1 1 aa 

1 199 


COO 1 O/l 

588184 


CC7A1 C 

587915 


COQ I O/l 

5oolo4 


1200 


ZAAC OT 

600587 


£ A AAA "7 

600907 


£AACO*7 

60058/ 


1 OA 1 

1201 


609/31 


/AOOfif 

60ooyj 


ouy 15 i 


1202 


614039 


£ 1 >ncf 

614/55 


£1/1 A*0 A 

6141)39 


1203 


£1/1 Ol 

614823 


615152 


£ 1 /I OO "5 

614823 


1 O A A 

1204 


638244 


638831 


£"5 00/1/1 

638244 


1205 


638819 


£0 AAA A 

639094 


638819 


i oa£ 
1206 


£0 AAT) 

639073 


639636 


639073 


1207 


£ A 1AA 1 

647901 


£ /I on £ 

648236 


£ /l*7 A A 1 

647901 


1208 


/nor i a 

678510 


£*7A/1£A 

o /y4oy 


£.70C 1 A 
6/oJ 1U 


1 1AH 

1209 


/oono 

688178 


688732 


6881 /o 


1 O 1 A 

1210 


696045 


£ A£ C£0 

696563 


£A£A/1C 

696045 


1211 


TAOAAO 

708998 


*7AO COO 

708588 


"7AO A AO 

/08yy8 


1212 


TAAO AO 

709808 


^ 1 AAO A 

/ 10089 


/09808 


1213 


O 1 A f\ 

7 1 8240 


717737 


*7 1 QO/I A 

/ 1 5Z4U 


1214 


T)nfi O O 

737828 


737565 


/3/oZo 


nic 

1215 


779502 


*7 O AOCO 

780257 


nnrjf AO 

/ /yjuz 


1 o 1 £ 

1216 


OA£0 1 A 

8063 1 0 


O AC 0£ vl 

805864 


OA/1 1 A 

81)63 10 


1 O 1 *7 

1217 


820931 


OO A*7A*7 

820707 


820931 


1218 


837696 


O AAAZT 

839096 


0*»"7£ A£ 

837696 


1219 


883307 


O O O C A f\ 

883549 


OOT1 AT 

883307 


1ZZU 


OA^AI A 


Q A 1 70£ 

oyl /ZO 


O07H1 n 


1221 


893277 


893564 


893277 


1222 


936998 


937225 


936998 


1223 


946865 


947419 


946865 


1224 


975187 


975411 


975187 
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ORFNos j 


begin 




potential start 


1225 


985882 


985517 


985882 


1226 


987713 


987180 


987713 


1227 


988215 


987733 


988215 


1228 


988754 


988530 


988754 


j 1229 


992542 


992841 


992542 


r" 1230 


992759 


993067 


992759 


r 1231 


1004247 


1004528 


1004268 


[ 1232 


1015013 


1014294 


1015013 


| 1233 


1056147 


1056545 


1056147 


1 1234 


1077682 


1078035 


1077682 


\ 1235 


1088121 


1088381 


1088121 


|~ 1236 


1098430 


1098852 


1098430 


[ 1237 


1098798 


1099319 


1098798 


\~ 1238 


1123198 


1123515 


1123198 


r~ 1239 


1123606 


1124256 


1123606 


f 1240 


1124453 


1124797 


1124453 


[ 1241 


1129253 


1129567 


1129253 


\ 1242 


, 1164947 


1164474 


1164947 


! 1243 


1170457 


1170053 


1170457 


S 1244 


1172342 


1171863 


1172342 


[ 1245 


1192155 


1192835 


1192155 


\ 1246 


, 119275S 


» 1192992 


1192759 


| 124' 


r 1193861 


1194142 


! 1193861 


J 124$ 


5 119403< 


> 1 19377* 


) 1194036 




124« 


) 120974 


I 120905! 


5 1209748 




125 


0 121511 


1 121541' 


) 1215111 




125 


1 121630 


2 121653 


8 1216302 




125 


2 122807 


2 122781 


8 1228072 




125 


3 122830 


4 122808 


0 1228304 




125 


4 2659 


•9 2622 


2 26599 




j 125 


,5 276C 


>9 2736 


,7 27609 




I 12f 


>6 672( 


)6 669( 


Yl 67197 




r 12. 


57 706 


12 7031 


52 70588 




12 


58 13271 


33 1329' 


45 132703 
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begin 


;--;:enU,::;:; : :;Xv 


: potential Starr 




1 /oO/J 


1 /8393 


1 7QAT1 

1 /o0/3 


1 O/CA 


20oj /o 


2U8349 


2085 /o 




OAO 1 ^£ 


OAOOOQ 

zUo9zy 


OAQt C£ 

zoy i jo 


1Z0Z 


zoyzoi 


O. AAAO A 

209024 


Z09zo3 


1 0<1 
IZOj 


Z 1UJW 




O 1 A1A/J 

Z WW* 


IZtrf 


IQGAAG 




0QOA1A 


1265 


1C11 A£ 

352106 


35171 / 


O COA/C 1 

352061 


1266 


420182 


vi i A C\A A 

419949 


/IT A 1 *7A 

4201 /0 


1267 


553602 


553381 


C C*} /L AO 

553602 


1 O ZT O 

1268 


556538 


C C £. O AT 

556807 


r r /CO o 

556538 


1269 


594348 


f Al TAT 

593 79 / 


CA/I O /IO 

594342 


1270 


c nc i iCn 

595169 


en/107/: 
5948 /6 


C A C 1 /I A 

595 160 


1 OT 1 

1271 


662148 


662381 


662 1 60 


1 OTO 

1272 


706528 


706893 


TA/CCO O 

706528 


1273 


o ao itc 

803315 


O AO /CA 

803650 


oao o o a 

803339 


1 OT /I 

1274 


849551 


849306 


O A AC C 1 

849551 


1 OTC 

1275 


913676 


ai o nc 

913275 


913676 


i ot/: 

1276 


927087 


AO OO ZT 

926836 


AOTAOT 

927087 


1277 


930587 


AO AO £. A 

930360 


no AC OT 

930587 


i no 

1278 


986531 


AOZTTZT /I 

986764 


986531 


1279 


aaz" oo a 

996229 


ftft/ AO £ 

996486 


fkA^OO A 

996229 


1 i o a 

1280 


1000373 


1 AAAAAO 

1 000002 


1 AAAO O /I 

1000334 


1281 


1010291 


1 A1 AAOT 

1010037 


1 A 1 AOTO 

10102/3 


1282 


i a 1 i no 
101 1 128 


1 A 1 AT no 

1010793 


1 AI 1 1 OO 

101 1 128 


1 ooo 

1283 


1 A 1 1A1>1 

1012924 


1 A 1 O/T A/I 

l 012694 


1 A 1 O AO A 

1012924 


1284 


1 AO OzTCA 

1028659 


1 AO OA 1 O 

1028913 


1 AOOZTC A 

1028659 


1285 


1 AO £. AO \ 

1086481 


1 AOZTT^O 

1086762 


1 AO/C A O 1 

1086481 


1286 


1 118658 


1 1 1 O O^A 

1 118879 


111 onco 

1 1 18658 


1287 


1 170098 


\ 1169835 


1 1 1AAA O 

1 170098 


lzoo 


1 1 OAOOQ 

1 isUoze 


11011 Q.A 


1 loUoZo 


1289 


1182658 


1183035 


1182658 


1290 


1195076 


1194795 


1195055 


1291 


1195890 


1196183 


1195890 


1292 


189042 


188809 


189030 
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ORFNos 


begin 


end 


potential, start: < 


1293 


691250 


Oy IjO/ 


691250 


1294 


914544 


914780 


914556 


1295 


928525 


928833 


928579 


1296 


1040685 


1040948 


1040712 


1297 


377646 


378068 


377646 
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Table 4 



SEQIDNO(ORF) 


Fp 


Fd 


Bp 


Bd 


2 


1292 


1293 


3796 


3797 


3 


1294 


1295 


3798 


3799 


4 


1296 i 


1297 


3800 


3801 


5 


1298 


1299 


3802 


3803 


6 


1300 


1301 


3804 


3805 


7 


1302 


1303 


3806 


3807 


8 


1304 


1305 


3808 


3809 


9 


1306 


1307 


3810 


3811 


10 


1308 


1309 


3812 


3813 


11 


1310 


1311 


3814 


3815 


12 


1312 


1313 


3816 


3817 


13 


1314 


1315 


3818 


3819 


14 


1316 


1317 


3820 


3821 


15 


1318 


1319 


3822 


3823 


16 


1320 


1321 


3824 


3825 


17 


1322 


1323 


3826 


3827 


18 


1324 


1325 


3828 


3829 


19 


1326 


1327 


3830 


3831 


20 


1328 


1329 


3832 


3833 


21 


1330 


1331 


3834 


3835 


22 


1332 


1333 


3836 


3837 


23 


1334 


1335 


3838 


3839 


24 


1336 


1337 


3840 


3841 


25 


1338 


1339 


3842 


3843 


26 


1340 


1341 


3844 


3845 


27 


1342 


1343 


3846 


3847 


28 


1344 


1345 


3848 


3849 


29 


1346 


1347 


3850 


3851 


30 


1348 


1349 


3852 


3853 


31 


1350 


1351 


3854 


3855 


32 


1352 


1353 


3856 


3857 


33 


1354 


1355 


3858 


3859 


34 


1358 


1359 


3862 


3863 
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35 


1356 


1357 


3860 


1Q£ 1 

3ool 


36 


1360 


1361 


3864 


3o05 


37 


1362 


1363 


3866 


350/ 


38 


1364 


1365 


3868 


joOy 


39 


1366 


1367 


3870 


JO / 1 


40 


1368 


1369 


3872 


JO / J 


41 


1370 


1371 


3874 


^87^ 
JO / J 


42 


1374 


1375 


3878 


3o/y 


43 


1376 


1377 


3880 


3ool 


44 


1380 


1381 


3884 


1CQC 


45 


1382 


1383 


3886 


3oo/ 


46 


1386 


1387 


3890 


joy l 


47 


1388 


1389 


3892 


3oV3 


48 


1392 


1393 


3896 


Soy 1 


49 


1394 


1395 


3898 


3899 


| 50 


1396 


1397 


3900 


3901 


51 


1398 


1399 


3902 


*i ft/11 

3903 


52 


1402 


1403 


3906 


390/ 


53 


1400 


1401 


3904 


3905 


54 


1404 


1405 


3908 


TAAft 

39Uy 


55 


1406 


1407 


3910 


391 1 


56 


1410 


1411 


3914 


3915 


57 


1412 


1413 


3916 


3^1/ 


58 


1414 


! 1415 


3918 


3y iy 


59 


1416 


1417 


3920 


3yzi 


60 


1418 


1419 


3922 


3923 


61 


1420 


1421 


3924 


3925 


62 


1422 


1423 


3926 


3927 


63 


1424 


1425 


3928 


3929 


64 


1426 


1427 


3930 


3931 


65 


1 /IOC 




| 3932 


3933 


66 


1430 


1431 


3934 


3935 


67 


1432 


1433 


3936 


3937 


! 68 


1434 


1435 


3938 


3939 


69 


1438 


1439 


3942 


3943 



WO 99/27105 



177 



PCIYIB98/01890 



70 


1440 


1441 


3944 


3945 


71 


1444 


1 A A C 

1445 


3948 ; 


3949 


72 


1446 


1447 


3950 


3951 


73 


1448 


1449 


3952 


3953 


74 


1450 


1451 


3954 


3955 


75 


1452 


1453 


3956 


3957 


76 


1 A C A 

1454 


1455 


3958 


3959 


77 


1456 


1457 


3960 


3961 


78 


1458 


1459 


3962 


3963 


79 


1460 


1461 


3964 


3965 


i 80 


1462 


1463 


3966 


3967 


81 


1464 


1465 


3968 


3969 


82 


1468 


1469 


3972 


3973 


83 


1470 


1471 


3974 


3975 


84 


1472 


1473 


3976 


3977 


85 


1476 


1477 


3980 


3981 


86 


1478 


1479 


3982 


3983 


87 


1480 


1481 


3984 


3985 


88 


1482 


1483 


3986 


3987 


89 


1484 


1485 


3988 


3989 


90 


1486 


1487 


3990 


3991 


91 


1488 


1489 


3992 


3993 


92 


1490 


1491 


3994 


3995 


93 


1492 


1493 


3996 


3997 


94 


1494 


1495 


3998 


3999 


95 


1496 


1497 


4000 


4001 


96 


1498 


1499 


4002 


4003 


97 


1500 


1501 


4004 


4005 


98 


1502 


1503 


4006 


4007 


99 


1504 


1505 


4008 


4009 


100 


1506 


1507 


4010 


4011 


101 


1508 


1509 


4012 


4013 


102 


1510 


1511 


4014 


4015 


103 


1512 


1513 


4016 


4017 


104 


1514 


1515 


4018 


4019 



PCT/1B98/01890 

WO 99/27105 

178 



105 


1516 


1517 


4020 


4UZ I 


106 


1518 


1519 


4022 




107 


1520 


1521 


4024 


4UZj 


108 


1522 


1523 


4026 




109 


1524 


1525 


4028 




110 


1526 


1527 


a m A 

4030 


Ami 


111 


1530 


1531 


4034 


H\J3J 


112 


1532 


1533 


4036 


4Uj l 


113 


1534 


1535 


4038 




114 


1536 


1537 


4040 


A(\A 1 
4U41 


r us 


1538 


1539 


4042 


A(\A1 

4U4.3 


116 


1540 


1541 


A A A A 

4044 




117 


1542 


1543 


4046 


4U4 / 


118 


1544 


1545 


4048 


4U4V 


119 


1546 


1547 


4050 


4UD 1 


120 


1548 


1549 


4052 


40j3 


121 


1550 


1551 


4054 


4Ujj 


122 


1552 


1553 


4056 


40 J / 


123 


1554 


1555 


4058 


A AC A 

40 jy 


124 


1556 


1557 


4060 


40ol 


125 


1558 


1559 


4062 


40 0 J 


126 


1562 


1563 


4066 


40o/ 


127 


1564 


1565 


4068 


aci&q 
4UD7 


128 


1566 


1567 


4070 


4U / 1 


129 


1568 


1569 


4072 


4U / j 


130 


1570 


1571 


4074 


4U / j 


131 


1572 


1573 


4076 


40/ / 


132 


1574 


1575 


4078 


40 fy 


133 


1576 


1577 


4080 


A AO 1 

4081 


134 


1580 


1581 


4084 


A AOC 

4085 


135 


IDoZ 


1583 


4086 


4087 


136 


1584 


1585 


4088 


4089 


137 


1586 


1587 


4090 


4091 


138 


1588 


1589 


4092 


4093 


139 


1590 


j 1591 


4094 


4095 
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140 


1592 


1593 


4096 


4097 


141 


1594 


1595 


4098 


4099 I 


142 


1596 


1597 


4100 


4101 


143 


1598 


1599 


4102 | 


4103 


144 


1604 


1605 


4108 j 


4109 


145 


1606 


1607 


4110 


4111 


146 


1612 


1613 


4116 


41 17 


147 


1614 


1615 


4118 


4119 


148 


1616 


1617 


4120 


4121 


149 


1618 


1619 


4122 


4123 


150 


1620 


1621 


4124 


4125 


151 


1624 


1625 


4128 


4129 


152 


1622 


1623 


4126 


4127 


153 


1626 


1627 


4130 


4131 


154 


1628 


1629 


4132 


4133 


155 


1630 


1631 


4134 


4135 


156 


1632 


1633 


4136 


4137 


157 


1634 


1635 


4138 


4139 


158 


1636 


1637 


4140 


4141 


159 


1638 


1639 


4142 


4143 


160 


1640 


1641 


4144 


4145 


161 


1642 


1643 


4146 


4147 


162 


1644 


1645 


4148 


4149 


163 


1646 


1647 


4150 


4151 


164 


1648 


1649 


4152 


4153 


165 


1650 


1651 


4154 


4155 


166 


1652 


1653 


4156 


4157 


167 


1656 


1 1657 


4160 


4161 


168 


1658 


1659 


4162 


4163 


169 


1662 


1663 


4166 


4167 


170 


1664 


1665 


4168 


4169 


171 


1666 


1667 


4170 


4171 


172 


1668 


1669 


4172 


4173 


173 


1670 


1671 


4174 


4175 


174 


1672 


1673 


4176 


4177 
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175 


1674 


1675 


4178 


41/9 


176 


1676 


1677 


4180 


A 1 O 1 

41ol 


177 


1678 


1679 


4182 


A 1 01 

41oi 


178 


1684 


1685 


4188 


a i on 
41oV 


179 


1686 


1687 


4190 


41V l 


180 


1688 


1689 


4192 


A 1 Q1 


181 


1690 


1691 


4194 


A 1 


182 


1694 


1695 


4198 


a i no 
4199 


183 


1696 


1697 


4200 


4ZU1 


184 


1698 


1699 


4202 


a 


185 


1700 


1701 


4204 


>IO AC 

4205 


186 


1704 


1705 


4208 




i 187 


1708 


1709 


4212 


4213 


188 


1710 


1711 


4214 


A ^ 1 C 

4215 


189 


1712 


1713 


4216 


A*\ 1 1 

4217 


190 


1714 


1715 


4218 


A*\ 1 (\ 

4219 


191 


1720 


1721 


4224 


4225 


192 


1722 


1723 


4226 


4227 


193 


1724 


1725 


4228 


4229 


194 


1726 


1727 


4230 


4231 


195 


1728 


1729 


4232 


4233 


196 


1732 


1733 


4236 


4231 ! 


197 


1734 


1735 


4238 


A no 

42iy 


198 


1736 


1737 


4240 


A^A 1 

4241 


199 


1738 


1739 


4242 


A1A1 

424J 


200 


1740 


1741 


4244 


A^i A C 

4245 


201 


1742 


1743 


4246 


4247 


202 


1744 


1745 


4248 


4249 


203 


1746 


1747 


4250 


4251 


204 


1750 


1751 


4254 


4255 


205 


1752 


1 / Jj 


4256 


4257 


206 


1754 


1755 


4258 


4259 


207 


1756 


1757 


4260 


4261 


208 


1758 


1759 


4262 


4263 


209 


1760 


1761 


4264 


4265 
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7 1 A 

210 


1 7£7 

1 /62 


1 /03 


4Zoo 


4Z0 / 


oil 
21 1 


1 /o4 


1 /OD 


4Z0o 




212 


1766 


1767 


A 77A 
42 /U 


/I77 1 

42/1 


213 


1 7/;o 
1 /6o 


1 7 /CO 

1 /oy 


/1777 

42/2 


/1777 

42/3 


*"» 1 A 

214 


1770 


1 T71 

1771 


42/4 


42 /D 


ZlD 


1 77') 
1 / /Z 


1 7*71 
1 / /3 


4Z /o 


>1777 
4Z / / 


216 


1 77/1 
1 / /4 


1 77C 
1 / ID 


,4770 

42 /o 


/177Q 

42 /y 


7 17 

217 


1 77£ 
1 / /6 


1 777 
1 / / / 


4ZoU 


/I70 1 

42ol 


218 


1 770 

1 / /o 


1 77A 

1 / /9 


4282 


/17 07 

4Z©3 


219 


1782 


1783 


4286 


;I1 oi 

4287 


220 


1784 


1785 


4288 


4289 


221 


1786 


1 TOT 

1787 


A OAA 

4290 


/I OA 1 

4291 


222 


1788 


1789 


4292 


4293 


111 

223 


1790 


1 T A 1 

1791 


/1 7 A/t 

4294 


4295 


224 


1792 


1793 


a on/ 

4296 


/I 1 A1 

4297 


225 


1796 


1797 


4300 


4301 


226 


1800 


1801 


4304 


4305 


227 


1802 


1803 


4306 


4307 


228 


1804 


1805 


4308 


4309 


nn 

229 


1806 


1807 


A "> 1 A 

4310 


431 1 


11 a 

230 


1 OAO 

1808 


1 OAA 

1809 


4312 


A O 1 O 

4313 


n 1 
231 


1 O 1 A 

1810 


1 O 1 1 

181 1 


4314 


/IT 1 C 

4315 


232 


1812 


ion 

1813 


A 1 1 ^" 

4316 


4317 


233 


1818 


1819 


4322 


/II 11 

4323 


234 


1824 


1825 


4328 


/1 11 A 

4329 


235 


1826 


1827 


A "5 1 A 

4330 


4331 


236 


1828 


1829 


>l 1 1 7 

4332 


/i 1 1 1 
4333 


237 


1 Ol A 

j 1 834 


1835 


A 1 1 O 

4338 


4339 


238 


1836 


i on 

1837 


/I 7 /I A 

4340 


4341 


239 


1840 


1841 


4344 


4345 


240 


1 0/17 

1842 


10/11 

1843 


/ti/i£ 
4340 


434 / 


241 


1846 


1847 


4350 


4351 


242 


1848 


1849 


4352 


4353 


243 


1850 


1851 


4354 


4355 


244 


1852 


1853 


4356 


4357 
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245 


1854 


1855 


A O C O 

4358 


Hjjy 


246 


1856 


1857 


4360 


4jD 1 


247 


1858 


1859 


43 6/ 




248 


I860 | 1861 


A1&A S 

4304 




[~~ 249 


1862 


1863 


43 oo 




|~ 250 


1864 


1865 


A1 £Q 

43oo 


AIM 


I 251 


1866 


1867 


43 /U 


A11\ 
Hj / 1 


[ 252 


1868 


1869 


4372 


Alii 
4 J / j 


J 253 


1872 


1873 


43 /o 




[~~ 254 


1876 


1877 


4380 


AIR} 


| 255 


1874 


1875 


4378 


4j 17 


[ 256 


1878 


1879 


4382 


A191 

4JOJ 


F 257 


1886 


1887 


4390 


A 701 

4jy i 


P 258 


1888 


1889 


4392 


A lOI 

43 y j 


\ 259 


1890 


1891 


4394 




260 


1892 


1893 


4396 


43V / 


r~ 26i 


1896 


1897 


4400 




262 


1894 


1895 


4398 


myy 


263 


1898 


1899 


4402 


AA(\1 


j 264 


1900 


1901 


4404 


AA(\S 

HH\JJ 


265 


1902 


1903 


4406 


AAfil 
44U / 


i 266 


1904 


1905 


4408 


44UV 


j 267 


1906 


1907 


4410 


AA\ 1 


r~ 268 


1908 


1909 


4412 


4*r 1 J 


[ 269 


1910 


1911 


4414 


4413 


h~ 270 


1912 


1913 


4416 


441 / 


J 271 


1914 


1915 


4418 


44 iy 


272 


1916 


1917 


4420 


44Z1 


\ 273 


1918 


1919 


4422 


AA^I 

44Z3 


| 274 


1920 


1921 


4424 


44 z j 


275 


1922 


1923 


4426 


4427 


r 276 


1924 


1925 


4428 


4429 


277 


1926 


1927 


4430 


4431 


! 278 


1928 


1929 


4432 


4433 


| 279 


1930 


1931 


4434 


4435 
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280 


1 ft "5 A 

1934 


i ft "> c 

1935 


4438 


A A "> ft 

4439 


281 


1936 


1 ftO "7 

1937 


4440 


A A A 1 

4441 


282 


1938 


1939 


4442 


4443 


283 


1940 


i c\ A i 

1941 


4444 


4445 


284 


1942 


1943 


4446 


4447 


285 


1944 


1 ft A C 

1945 


4448 


4449 


286 


i ft a n 

1946 


1 ft A *7 

1947 


4450 


4451 


287 


1948 


1949 


4452 


4453 


288 


1950 


1 ft C 1 

1951 


4454 


A A C C 

4455 


289 


1952 


1953 


4456 


4457 


290 


1954 


1955 


4458 


4459 


291 


1956 


1957 


4460 


4461 


292 


1958 


1959 


4462 


4463 


293 


1960 
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WHAT IS CLAIMED IS: 

1- An isolated polynucleotide having a nucleotide sequence of a Chlamydia pneumoniae 

genome, comprising 
5 (a) the a nucleotide sequence of SEQ ID No. 1 ; 

(b) the nucleotide sequence contained within the Chlamydia 
pneumoniae genomic DNA in ATCC Deposit No. ; 

(c) the nucleotide sequence contained in a clone insert in ATCC 
Deposit No. ; 

10 (d) a nucleotide sequence exhibiting at least 99.9% identity with the 

sequence of SEQ ID No. 1 ; or 
(e) a nucleotide sequence exhibiting at least 80% homology to SEQ 
ID No. 1. 

15 2- An isolated polynucleotide which hybridizes to SEQ ID No. 1 or to the Chlamydia 

pneumoniae genomic DNA contained in ATCC deposit No. or to a clone insert in 

ATCC Deposit No. under conditions of high stringency. 

3- An isolated polynucleotide which hybridizes to SEQ ID No. 1 or to the Chlamydia 
20 pneumoniae genomic DNA contained in ATCC deposit No. under conditions of 

intermediate stringency. 

4- An isolated polynucleotide having a nucleotide sequence of an open reading frame (ORF) 
of a Chlamydia pneumoniae genome, comprising: 

25 ( a ) a nucleotide sequence chosen from one of ORF2 to ORF 1 297; 

(b) a nucleotide sequence exhibiting at least 99.9% identity with 
one of ORF2 to ORF 1297; or 

(c) a nucleotide sequence exhibiting at least 80% homology to one 
of ORF2 to ORF 1297. 

30 

5- An isolated polynucleotide which hybridizes to one of ORF2 to ORF 1297 under 
conditions of high stringency. 

6- An isolated polynucleotide which hybridizes to one of ORF2 to ORF 1297 under 
35 conditions of intermediate stringency. 

7- The polynucleotide of Claims 2, 3, 4, 5, or 6 which encodes the following polypeptides or 
fragments thereof: 

(a) a Chlamydia pneumoniae transmembrane polypeptide having 
40 between 1 and 3 transmembrane domains; 
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a Chlamydia pneumoniae transmembrane polypeptide having 
between 4 and 6 transmembrane domains; 
a Chlamydia pneumoniae transmembrane polypeptide having at 
least 7 transmembrane domains; 

a Chlamydia pneumoniae polypeptide involved in intermediate 
metabolism of sugars and/or cofactors; 

a Chlamydia pneumoniae polypeptide involved in intermediate 
metabolism of nucleotides or nucleic acids; 
a Chlamydia pneumoniae polypeptide involved in metabolism 
of amino acids or polypeptides; 

a Chlamydia pneumoniae polypeptide having involved in 
metabolism of fatty acids; 

a Chlamydia pneumoniae polypeptide involved in the synthesis 
of the cell wall; 

a Chlamydia pneumoniae polypeptide involved in transcription, 

translation, and/or maturation process; 

a Chlamydia pneumoniae transport polypeptide; 

a Chlamydia pneumoniae polypeptide involved in the virulence 

process; 

a Chlamydia pneumoniae polypeptide involved in the secretory 
system and/or which is secreted; 

a Chlamydia pneumoniae polypeptide of the cellular envelope 
or outer cellular envelope of Chlamydia pneumoniae, 
a Chlamydia pneumoniae surface exposed polypeptide; 
a Chlamydia pneumoniae lipoprotein; 

a Chlamydia pneumoniae polypeptide involved in 

lipopolysaccharide biosynthesis; 

a Chlamydia pneumoniae KDO-related polypeptide; 

a Chlamydia pneumoniae phosphomannomutase-related 

polypeptide; 

a Chlamydia pneumoniae lipid A component-related 
polypeptide; 

a Chlamydia pneumoniae phosphoglucomutase-related 
polypeptide; 

a Chlamydia pneumoniae polypeptide that contains an RGD 
sequence; 

a Chlamydia pneumoniae Type III secreted polypeptide; 

a Chlamydia pneumoniae cell wall anchored surface 

polypeptide; or 
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(x) a Chlamydia pneumoniae polypeptide that is not found in 
Chlamydia trachomatis. 

8- A polynucleotide encoding a fusion protein, comprising one of ORF2 to ORF1297 of 
5 Claim 4, 5, or 6 ligated in frame to a polynucleotide encoding a heterologous polypeptide. 

9- A recombinant vector that contains the polynucleotide of Claim 1, 2, 3, 4, 5 or 6. 

10- A recombinant vector that contains the polynucleotide of Claim 8. 

10 U- A recombinant vector that contains the polynucleotide of Claim 4, 5 or 6, operatively 
associated with a regulatory sequence that controls gene expression. 

12- A recombinant vector that contains the polynucleotide of Claim 8 operatively associated 
15 with a regulatory sequence that controls gene expression. 

13- A genetically engineered host cell that contains the polynucleotide of Claim 1, 2, 3, 4, 5 or 
6. 

20 1 4- A genetically engineered host cell that contains the polynucleotide of Claim 8. 

15- A genetically engineered host cell that contains the polynucleotide of Claim 4, 5 or 6 
operatively associated with a regulatory sequence that controls gene expression m the host 
cell. 

25 16- A genetically engineered host cell that contains the polynucleotide of Claim 8 operatively 
associated with a regulatory sequence that controls gene expression in the host cell. 

17- A method for producing a polypeptide, comprising: 
30 (a ) culturing the genetically engineered host cell of Claim 1 5 under 

conditions suitable to produce the polypeptide encoded by the 
polynucleotide; and 
(b) recovering the polypeptide from the culture. 

35 1 8- A method for producing a fusion protein, comprising: 

(a) culturing the genetically engineered host cell of Claim 16 under 
conditions suitable to produce the fusion protein encoded by the 
polynucleotide; and 

(b) recovering the fusion protein from the culture. 
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19- A polypeptide encoded by the polynucleotide of Claim 4, 5 or 6. 

20- The polypeptide of Claim 19 which immunoreacts with seropositive serum of an 
5 individual infected with Chlamydia pneumoniae. 

21- The polypeptide of Claim 19 which comprises the following polypeptides or fragments 
thereof: 

(a) a Chlamydia pneumoniae transmembrane polypeptide having 
10 between 1 and 3 transmembrane domains; 

(b) a Chlamydia pneumoniae transmembrane polypeptide having 
between 4 and 6 transmembrane domains; 

(c) a Chlamydia pneumoniae transmembrane polypeptide having at 
least 7 transmembrane domains; 

15 (d) a Chlamydia pneumoniae polypeptide involved in intermediate 

metabolism of sugars and/or cofactors; 

(e) a Chlamydia pneumoniae polypeptide involved in intermediate 
metabolism of nucleotides or nucleic acids; 

(f) a Chlamydia pneumoniae polypeptide involved in metabolism 
20 of amino acids or polypeptides; 

(g) a Chlamydia pneumoniae polypeptide involved in metabolism 
of fatty acids; 

(h) a Chlamydia pneumoniae polypeptide involved in the synthesis 
of the cell wall; 

25 (i) a Chlamydia pneumoniae polypeptide involved in transcription, 

translation, and/or maturation process; 
(j) a Chlamydia pneumoniae transport polypeptide; 
(k) a Chlamydia pneumoniae polypeptide involved in the virulence 
process; 

30 (1) a Chlamydia pneumoniae polypeptide involved in the secretory 

system and/or which is secreted; 
(m) a Chlamydia pneumoniae polypeptide of the cellular envelope 

or outer cellular envelope of Chlamydia pneumoniae. 
(n) a Chlamydia pneumoniae surface exposed polypeptide; 
35 (o) a Chlamydia pneumoniae lipoprotein; 

(p) a Chlamydia pneumoniae polypeptide involved in 

lipopolysaccharide biosynthesis; 
(q) a Chlamydia pneumoniae KDO-related polypeptide; 
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(r) a Chlamydia pneumoniae phosphomaimomutase-related 
polypeptide; 

(s) a Chlamydia pneumoniae phosphoglucomutase-related 
polypeptide; 

(t) a Chlamydia pneumoniae lipid A component-related 
polypeptide; 

(u) a Chlamydia pneumoniae polypeptide that contains an RGD 
sequence; 

(v) a Chlamydia pneumoniae Type III secreted polypeptide; 
(w) a Chlamydia pneumoniae cell wall anchored surface 
polypeptide; or 

(x) a Chlamydia pneumoniae polypeptide that is not found in 
Chlamydia trachomatis. 

15 22- A fusion protein encoded by the polynucleotide of Claim 8. 

23- The fusion protein of Claim 22 which immunoreacts with seropositive serum of an 
individual infected with Chlamydia pneumoniae. 

20 24- An antibody that immunospecifically binds to the polypeptide of Claim 19. 

25- An antibody that immunospecifically binds to the fusion protein of Claim 22. 

26- A method for the detection and/or identification of Chlamydia pneumoniae in a biological 
25 sample, compnsmg. ^ ^ sample ^ a polynucleotide primer of Claim 1, 

2, 3, 4, 5, or 6 in the presence of a polymerase enzyme and 
nucleotide's under conditions which permit primer extension; 
and 

(b) detecting the presence of primer extension products in the 
sample in which the detection of primer extension products 
indicates the presence of Chlamydia pneumoniae in the sample. 

27- A method for the detection and/or identification of Chlamydia pneumoniae in a biological 
35 samplccomprising: ^ ^ ^ & ^ o{ C ^ I, 

2, 3, 4, 5, or 6 under conditions which permit hybridization of 
complementary base pairs; and 
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(b) detecting the presence of hybridization complexes in the sample 
in which the detection of hybridization complexes indicates the 
presence of Chlamydia pneumoniae in the sample. 



5 28- A method for the detection and/or identification of Chlamydia pneumoniae in a biological 
sample, comprising: 

(a) contacting the sample with the antibody of Claim 24 under 
conditions suitable for the formation of immune complexes; and 

(b) detecting the presence of immune complexes in the sample, in 
10 which the detection of immune complexes indicates the 

presence of Chlamydia pneumoniae in the sample. 

29- A method for the detection and/or identification of antibodies to Chlamydia pneumoniae 
in a biological sample, comprising: 

15 (a) contacting the sample with a polypeptide of Claim 19 under 

conditions suitable for the formation of immune complexes; and 
(b) detecting the presence of immune complexes in the sample, in 
which the detection of immune complexes indicates the 
presence of Chlamydia pneumoniae in the sample. 

20 

30- A DNA chip containing an array of polynucleotides comprising at least one of the 
polynucleotides of Claim 1, 2, 3, 4, 5, or 6. 

31- A protein chip containing an array of polypeptides comprising at least one of the 
25 polypeptides of Claim 19. 

32- An immunogenic composition comprising the polypeptide of Claim 19 and a 
pharmaceutically acceptable carrier. 

30 33- An immunogeneic composition comprising the polypeptide of Claim 20 and a 
pharmaceutically acceptable carrier. 

34- An immunogenic composition comprising the fusion protein of Claim 22 and a 
pharmaceutically acceptable carrier. 

35 

35- An immunogenic composition comprising the fusion protein of Claim 23 and a 
pharmaceutically acceptable carrier. 



PCMB98/01890 

WO 99/27105 

284 

36- A pharmaceutical composition comprising the polypeptide of Claim 19 and a 
pharmaceutically acceptable carrier. 

37- A pharmaceutical composition comprising the polypeptide of Claim 20 and a 
5 pharmaceutically acceptable carrier. 

38- A pharmaceutical composition comprising the polypeptide of Claim 22 and a 
pharmaceutically acceptable carrier. 

10 39- A pharmaceutical composition comprising the polypeptide of Claim 23 and a 
pharmaceutically acceptable carrier. 

40- A method of immunizing against Chlamydia pneumoniae, comprising: administering to a 
host an immunizing amount of the immunogenic composition of Claim 32. 

15 41- A method of immunizing against Chlamydia pneumoniae, comprising: administering to a 
host an immunizing amount of the immunogenic composition of Claim 33. 

42- A method of immunizing against Chlamydia pneumoniae, comprising administering to a 
20 host an immunizing amount of the immunogenic composition of Claim 34. 

43- A method of immunizing against Chlamydia pneumoniae, comprising: administering to a 
host an immunizing amount of the immunogenic composition of Claim 35. 

25 44- A DNA immunogenic composition comprising the expression vector of Claim 11. 

45- The DNA composition of Claim 44, wherein the DNA composition directs the expression 
of a neutralizing epitope of Chlamydia pneumoniae. 

30 46- A DNA immunogenic composition comprising the expression vector of Claim 12. 

47- The DNA composition of Claim 46, wherein the DNA composition directs the expression 
of a neutralizing epitope of Chlamydia pneumoniae. 

35 48- A screening assay, comprising: 

(a) contacting a test compound with an isolated polynucleotide of 

Claim 1,2, 3, 4, 5 or 6; and 

(b) detecting whether binding occurs. 
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49- A screening assay, comprising: 

(a) contacting a test compound with the polypeptide of Claiml9; 
and 

(b) detecting whether binding occurs. 

5 

50- A screening assay, comprising: 

(a) contacting a test compound with the polypeptide of Claim 22; 
and 

(b) detecting whether binding occurs. 

10 51- A kit comprising a container containing an isolated polynucleotide of Claim 1, 2, 3, 4, 5 
or 6. 

52- The kit of Claim 51 wherein the polynucleotide is a primer or a probe. 

15 53- The kit of Claim 51 wherein the polynucleotide is a primer and the kit further comprises a 
container containing a polymerase. 

54- The kit of Claim 51 which further comprises a container containing deoxynucleotide 
triphosphates. 

20 

55- A kit comprising a container containing an antibody that immunospecifically binds to the 
polypeptide of Claim 19. 

56- A kit comprising a container containing an antibody that immunospecifically binds to the 
25 fusion protein of Claim 22. 
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SEQUENCE LISTING 

<110>Genset SA 

<120>Chlamydia pneumoniae genomic sequence and polypeptides, fragments 

thereof and uses thereof, in particular for the diagnosis, prevention 

and treatment of infection 

<151>1997-11-21 

<160>6849 

<210>1 

<211>1230025 
<212>DNA 

<213>Chlamydia pneumoniae 
<400>1 

atagaaaact attaaaaaat cattgattct gtcgggaaag tatgcggata aaattcagag 60 

agaataagga gaggaagatg acaaggcaga gttatgtttt gggcaattgg aaaatgcaca 120 

aaacaatcca agaagctaaa gagtatgttc aaacattagc ttctntacta caaggagaac 180 

ctctttcctg cactataggc atagcttctc catttacctc tttgagagcg attcatgaga 240 

tgataaacac tacgggagct tttctctggt tgggagcaca aaatgtccat cccgagcttt 300 

cgggtgcttt tactggagaa atttccttac ctatgcttaa ggaggtagga gtggaatttg 360 

ttttagtagg tcactccgag cgtcgtcata tttttggaga gagtgatgcc tttattgctt 420 

caaaggtaaa gtctgtagct caggcgggac tcgtgcctgt tctttgtgtt ggagagagct 480 

tagaagttcg tgaagaggga aaggcgcatc aggtaatcaa aaaacagttg cttttgggat 540 

tggaacagat ggataatggt tccgaatttt tgatcgccta tgaaccagta tgggctatcg 600 

gcacagggaa ggtggcagaa gcttcggatg tgcaagatat tcatatgttt tgtcgtgagg 660 

tagtggcaga gaggttctca gaagctacag ctgaagagat ttcgattttg tacggaggat 72 0 

ctgtgaaggt cgataatgct cagcgatttg ggcaatgtag cgacgtcgat ggtcttttag 780 

ttggcggant tctttagang ggcaaagttt ttttgaagtc gctaaaaatt ttaatgtata 840 

atttgtgaga gttatgagat ttttttgtct attttttctt gggttcctag gatcttttca 900 

ttgtgttgct gaagacaagg gcgtggattt atttggagtc tgggacgata accaaattac 960 

agagtgtgac gatagttaca tgacagaggg tcgtgaagag gttgaaaagg tagtggacgc 1020 

ttagtccatc ggcttttatt tatattctcc ctaaggaagt cctgtattga agatcgcttt 1080 

ctcatagata gaagtaattt tcagatagtc aataattggt ttttttaaga gaatgctagg 1140 

caggtgctcg tgtttgggca tttgattaag tctacatgaa tctggaggga gagattcttc 1200 

tggtattgag aagtagaaca aaaaacaagg atcagacgtt ctccgatgtc ttcctaatcg 1260 

atgtctttaa ataaggagat tggcatgaca gtgttgtttt acgcattttt attcattttc 1320 

ctttttctat gtgtaattct ttgtggctta atcctggttc aagagagtaa gagcatgggg 1380 

ttaggttctt cgttcggcgt ggattctgga gattctgtct ttggtgtctc tactccagat 1440 

attttgaaaa aagtgacttc atngtgtgct gttgctttct gcataggttg tttactactt 1500 

tcattttcca cgaatctctt ggggaaaaag ttagatgcta aagaatttct attgcctgct 1560 

gctgaggaga gcgacactca agcttcttct gagagcgttg aagcagatga atcctagcct 1620 

atttgcggaa ttaggtgttg tctagattga agtgcaataa agctagcaag tttttatctt 1680 

catacgagat atgagtgtac ggtcggataa gagtagaaat ctttcttttg ttcctatggt 1740 

taagaagtcc tttggcttcc ttaaagagta tgactcttat caacccaaga aatgttttag 1800 

atccaagtgc ttgtcgtacg agatttcttc acagagctct gccaagccca tgttcagact 1860 

atgattagac gtttagaata ttacggcagt cctattttaa ggaaaaagtc ttccccaatt 1920 

gcagagatca cagatgagat tcgtaatctc gtgagtgata tgtgtgatac tatggaagca 1980 

catcgtggtg tcggtttagc cgctcctcag gtagggaaaa acgtcagttt atttgtcatg 2040 

tgtgtagata gagagactga ggatggagag ttgattttct ctgagtctcc gagggtattt 2100 

atcaatcctg ttctatcaga tccttctgaa accccgatca taggtaaaga aggatgtctt 2160 

tctattcctg gattgcgagg agaagtattc cgccctcaga aaatcacagt gaccgctatg 222 0 

gatctcaatg gtaaaatatt tactgagcac ttggaaggat tcactgcacg tatcattatg 2280 

cacgagactg accatctgaa tggagttctc tatattgatc ttatggaaga acccaaagat 2340 

cctaaaaaat ttaaagcctc tttagagaag atcaaacgtc gctacaatac acacttgagt 2400 

aaagaagaac tagtttctta attgctcttc agtctgatgt aggtgatatt ttcttgtctc 2460 

ttgcgtcaca tttgttgtca gctttgctta tttccccgaa caaatttcgt caaaggtttt 2520 

aaaatgtgtc ttgctgattt ttgctaagag ctctttccct cgttgcttag cgatctctct 2580 

tcctgctgct ttgacattga atccagcacc tttaggaagc tgtacttgat attgttcttc 2640 

caacttctgt atcgactgta caaatgcatc tctagccaat atagaagctg ctgctacgac 2700 

tacatcttgt tctgcacgtg gcttttgtat taaagtaata tcggtttctt ttttttgaag 2760 

tgctttgagt agggtgtatt ctgaagctgc aaactgatct gaaatagcaa agacatctcc 2 82 0 

tgcaggtttg ggtgctaagt tgttgataac agtagcgtgg gcccaagcaa gaagtgtatt 2 880 

taaattctgg aatttcccat atagctcgtt atatttttct gggtatagaa tgatgacatc 2940 

gcagacacat agtgagcgta tgatacgtgc taaagaagcg attttcgtgt ctttgagatt 3000 

tttagagtct tggactttat tctcatagag tttttttaag atctctgcat tcgatgcata 3060 

gactgccgca atacataaag ggccaaaaaa atcacctttc cctgattcat cgactcccaa 3120 
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